المشاركات

عرض المشاركات من ديسمبر, 2021

Comparing Decision Trees

صورة
In the last article of the current year, we will examine and compare some of the tree algorithms for the classification. The dataset we are going to use for this will be the answers given to the loan applicants and their evaluated features for it. The first algorithm we will ... Continue reading: Comparing Decision Trees http://dlvr.it/SGLWSQ

How to find a Trimmed Mean in R

Visit finnstats for the most up-to-date information on Data Science, employment, and tutorials. If you want to read the original article, click here How to find a Trimmed Mean in R. Knowing a few algorithms in depth is preferable to knowing a bit about a lot of algorithms, visit finnstats. ... Continue reading: How to find a Trimmed Mean in R http://dlvr.it/SGKT4h

A Real World Use Case for a Bayesian Reliability Model – How to Incorporate FEA into Risk Estimates

صورة
Frequentist statistical methods, despite their flaws, are generally serviceable for a large suite of practical problems faced by engineers during product development of medical devices. But even in domains where simple models usually do the trick, there remain instances where a Bayesian approach is the best (and perhaps only logical) ... Continue reading: A Real World Use Case for a Bayesian Reliability Model – How to Incorporate FEA into Risk Estimates http://dlvr.it/SGJ336

Battery Storage ROI Analysis

 It's been a while since my last post and I'm taking my occasional blog in a different direction.Over the years I've learned that utility tariffs, although hard to follow, vary widely. They can have a significant financial impact on behind-the-meter activities, depending on the numbers.The map below shows ... Continue reading: Battery Storage ROI Analysis http://dlvr.it/SGHBrG

Simulating dice bingo

صورة
Note: This post was inspired by the “Classroom Bingo” probability puzzle in the Royal Statistical Society’s Significance magazine (Dec 2021 edition). Set-up Imagine that we are playing bingo, but where the numbers are generated by the roll of two 6-sided … Continue reading → Continue reading: Simulating dice bingo http://dlvr.it/SGFSBX

Get and Set working directory (setwd / getwd) in R

Visit for the most up-to-date information on Data Science, employment, and tutorials finnstats. If you want to read the original article, go here Get and Set working directory (setwd / getwd) in R Get and Set working directory in R, Your environment is always pointed to some working location when you ... Continue reading: Get and Set working directory (setwd / getwd) in R http://dlvr.it/SGFS92

R User Group Kano, Nigeria, is Bringing in the Next Generation of Students to the R Community

صورة
The R Consortium recently reached out to Umar Isah Adam to find out more about the status of the R User Group Kano. Umar is a freelance data analyst who... The post R User Group Kano, Nigeria, is Bringing in the Next Generation of Students to the R Community appeared ... Continue reading: R User Group Kano, Nigeria, is Bringing in the Next Generation of Students to the R Community http://dlvr.it/SGFS4j

Estimating recent population history from linkage disequilibrium with GONE and SNeP

صورة
In this post, we will look at running two programs that infer population history — understood as changes in linkage disequilibrium over time — from genotype data. The post will chronicle running them on some simulated data; it will be light on theory, and light on methods evaluation. Linkage disequilibrium, i.e. ... Continue reading: Estimating recent population history from linkage disequilibrium with GONE and SNeP http://dlvr.it/SGF8l0

Examining College Football Conference Realignment with {ggraph}

صورة
In my previous post I looked at College Football Non-Conference games to create a network map overlaid on top of the United States using the {ggraph} package. In this post I’ll be extending that to examine Conference Realignment, which is when colleges change from one conference to the next. ... Continue reading: Examining College Football Conference Realignment with {ggraph} http://dlvr.it/SGDGWB

Some Interesting Issues in VECM using R

صورة
This post deals with some interesting issues regarding the VECM model. Among them are the VAR representation of VECM, the weak exogeniety restrictions and user-defined cointegrating vectors and so on. With the help of useful R packages, these is... Continue reading: Some Interesting Issues in VECM using R http://dlvr.it/SG9JJ8

Sample and Population Variance in R

Visit for the most up-to-date information on Data Science, employment, and tutorials finnstats. If you want to read the original article, go here Sample and Population Variance in R Sample and Population Variance in R, The variance is a metric for determining how dispersed data values are around the mean. ... Continue reading: Sample and Population Variance in R http://dlvr.it/SG8sqx

How the Lagos R User Group Has Leveraged the Pandemic

صورة
Folajimi Aroloye is a data analytics professional and educator. He is the organizer for the Lagos R User Group. He is involved in capacity building. Folajimi started his data science... The post How the Lagos R User Group Has Leveraged the Pandemic appeared first on R Consortium. Continue reading: How the Lagos R User Group Has Leveraged the Pandemic http://dlvr.it/SG71S3

Exploring College Football Non-Conference Rivalries with {ggraph}

صورة
We’re in the middle of College Football’s bowl post-season and I’d been wanting to do a more in-depth post on networks using {tidygraph} and {ggraph} for a while. So now seemed like as good a time as any to explore some College Football data. I had used {... Continue reading: Exploring College Football Non-Conference Rivalries with {ggraph} http://dlvr.it/SG5zKg

R-Ladies Lagos on Data Visualization and Hybrid Events

صورة
Alimi Eyitayo is a Graduate of Computer Engineering and a social engineering professional with over 12 years of experience in the field of information technology. She is an international speaker... The post R-Ladies Lagos on Data Visualization and Hybrid Events appeared first on R Consortium. Continue reading: R-Ladies Lagos on Data Visualization and Hybrid Events http://dlvr.it/SG4CZ5

Oslo UseR! Group’s Diverse and Inclusive Environment Has Fostered a Resilient R Community

صورة
R Consortium talked to Raoul Wolf of the Oslo UseR! Group about the wide adoption of R in Norway, both in academia and industry. He explained how the pandemic initially... The post Oslo UseR! Group’s Diverse and Inclusive Environment Has Fostered a Resilient R Community appeared first on R ... Continue reading: Oslo UseR! Group’s Diverse and Inclusive Environment Has Fostered a Resilient R Community http://dlvr.it/SFJ9bk

Top 21 #RStats tweets of 2021

صورة
13 December 2021 The R Community has been as active as ever on Twitter during 2021. Here’s a selection of the top 21 tweets of the year. Not on Twitter and don’t know where to start? Don’t worry! I co-authored the free book Twitter for R programmers with Veerle van … The ... Continue reading: Top 21 #RStats tweets of 2021 http://dlvr.it/SFGVrP

Advent of 2021, Day 12 – Spark SQL

Series of Apache Spark posts: Dec 01: What is Apache Spark Dec 02: Installing Apache Spark Dec 03: Getting around CLI and WEB UI in Apache Spark Dec 04: Spark Architecture – Local and cluster mode Dec 05: Setting up Spark Cluster Dec 06: Setting up IDE Dec…Read more › Continue reading: Advent of 2021, Day 12 – Spark SQL http://dlvr.it/SFDr54

An easy to convert list to long table

 Say you have a list with different lengths of vectors, e.g. __ head(genesets_list)$KEGG_GLYCOLYSIS_GLUCONEOGENESIS [1] "ACSS2"   "GCK"     "PGK2"    "PGK1"    "PDHB"    "PDHA1"   "PDHA... Continue reading: An easy to convert list to long table http://dlvr.it/SFCnc6

How to Make R Markdown Snow

صورة
Last year, I tweeted about how to spread holiday cheer by letting your R Markdown documents snow. After all, what better to put people in the holiday spirit than to add a random 5% probability that whatever part of a document they are trying to read wi... Continue reading: How to Make R Markdown Snow http://dlvr.it/SFBnbY

Pastor and Stambaugh (2003) Liquidity Measure – Individual Stock

صورة
This post shows how to calculate the liquidity measure (gamma) of Pastor and Stambaugh (2003) using R code. For exposition purposes we use only two individual series : small firm index and ESG firm index. As expected, a PS liquidity measure for smal... Continue reading: Pastor and Stambaugh (2003) Liquidity Measure – Individual Stock http://dlvr.it/SFB5kf

Advent of 2021, Day 10 – Working with data frames

Series of Apache Spark posts: Dec 01: What is Apache Spark Dec 02: Installing Apache Spark Dec 03: Getting around CLI and WEB UI in Apache Spark Dec 04: Spark Architecture – Local and cluster mode Dec 05: Setting up Spark Cluster Dec 06: Setting up IDE Dec…Read more › Continue reading: Advent of 2021, Day 10 – Working with data frames http://dlvr.it/SF88bH

Advent of 2021, Day 9 – RDD Operations

صورة
Dec 01: What is Apache Spark Dec 02: Installing Apache Spark Dec 03: Getting around CLI and WEB UI in Apache Spark Dec 04: Spark Architecture – Local and cluster mode Dec 05: Setting up Spark Cluster Dec 06: Setting up IDE Dec 07: Starting Spark with R and Python…Read more › Continue reading: Advent of 2021, Day 9 – RDD Operations http://dlvr.it/SF4cNL

Why R? 2021 Starts Tomorrow

صورة
The time has come! We are running another Why R? Conference tomorrow! We will have streams from 7am UTC till 7pm UTC! Speakers from 5 continents, participants from all over the globe! Join us on our Slack channel for discussions and visit YouTube cha... Continue reading: Why R? 2021 Starts Tomorrow http://dlvr.it/SF3cTk

Successful R-based Test Package Submitted to FDA

صورة
The R Consortium is happy to announce that on Nov 22nd, 2021, the R Submissions Working Group successfully submitted an R-based test submission package through the FDA eCTD gateway! The... The post Successful R-based Test Package Submitted to FDA appeared first on R Consortium. Continue reading: Successful R-based Test Package Submitted to FDA http://dlvr.it/SF1Mbg

Rules of thumb for the design of ecological experiments

Rules of thumb for the design of ecological experiments Are you planning an experiment in the lab or field, or an observational study? Its a good idea to talk to a statistician before you start. Students often ask me for help with statistics after th... Continue reading: Rules of thumb for the design of ecological experiments http://dlvr.it/SF111p

Why R? 2021 Discussion Panels

صورة
This year we are organizing the fifth edition of Why R? 2021 conference. As the meetings are held in English we invite R users from all over the globe! The program is available on our youtube.com/WhyRFoundationVideos channel. Go and check what lecture... Continue reading: Why R? 2021 Discussion Panels http://dlvr.it/SF02y9

The rise of R in public health research institutes

Presentation by dr. Robby De Pauw, researcher at Sciensano. Robby explains how R is used by researchers at Sciensano (the national health institute in Belgium). There are about 300 researchers in the institute and about 80% are using R. Robby covers h... Continue reading: The rise of R in public health research institutes http://dlvr.it/SDxSJr

Shinytableau – How To Create Tableau Dashboard Extensions With R Shiny

صورة
If you’re in the business of business analytics, you know what Tableau is. It’s one of the top drag-and-drop BI tools for analyzing and visualizing data. Pretty much anyone can use it to make decent dashboards. But here’s the problem – it’s quite limiting for developers with ... Continue reading: Shinytableau – How To Create Tableau Dashboard Extensions With R Shiny http://dlvr.it/SDx0QW

Why R? 2021 Pearson Session

صورة
This year we are organizing the fifth edition of Why R? 2021 conference. As the meetings are held in English we invite R users from all over the globe! The program is available on our youtube.com/WhyRFoundationVideos channel. Go and check what lecture... Continue reading: Why R? 2021 Pearson Session http://dlvr.it/SDwR4R

Need to Code a Difficult Pharma Stats Table? The R Tables for Regulatory Submissions (RTRS) Working Group Wants to Know

The R Consortium’s R Tables for Regulatory Submissions (RTRS) Working Group has made considerable progress in identifying and working through the issues involved with developing a modern R based framework... The post Need to Code a Difficult Pharma Stats Table? The R Tables for Regulatory Submissions (RTRS) Working Group ... Continue reading: Need to Code a Difficult Pharma Stats Table? The R Tables for Regulatory Submissions (RTRS) Working Group Wants to Know http://dlvr.it/SDvHXd

Heterogeneous Treatment Effects with Instrumental Variables: A Causal Machine Learning Approach

صورة
Problem Setting In our forthcoming paper on Annals of Applied Statistics, we propose a new method – which we call Bayesian Causal Forest with Instrumental Variable (BCF-IV) – to interpretably discover the subgroups with the largest or smallest cau... Continue reading: Heterogeneous Treatment Effects with Instrumental Variables: A Causal Machine Learning Approach http://dlvr.it/SDt9pC

Shiny Weekly: News from the R Shiny Community

صورة
Shiny Weekly has officially launched! The Shiny Weekly newsletter is our way of creating a central resource for R Shiny news from the R community. We’re so thankful to those who manage R news aggregators and to those of you who post quality, informative content. We feel that a ... Continue reading: Shiny Weekly: News from the R Shiny Community http://dlvr.it/SDsQ5Y

A 2nd look at vaccination breakthroughs in Switzerland

صورة
Our Covid19 app provides a global view of the pandemic, but how effective is the vaccination in Switzerland? Since May 2020 we are showing a dashboard on our gallery that contains a global view of the COVID-19 Pandemic, with a further split by conti... Continue reading: A 2nd look at vaccination breakthroughs in Switzerland http://dlvr.it/SDsQ5F

Why R? 2021 Jumping Rivers Session

صورة
This year we are organizing the fifth edition of Why R? 2021 conference. As the meetings are held in English we invite R users from all over the globe! The program is available on our youtube.com/WhyRFoundationVideos channel. Go and check what lecture... Continue reading: Why R? 2021 Jumping Rivers Session http://dlvr.it/SDrtQS

9 new books added to Big Book of R

06 December 2021 Every time I update Big Book of R I’m blown away by how much good stuff is out there! In this release there’s 9 new books which covers the widest range of topics of any release to date :). Thanks to Sivuyile Nzimeni for one of the … The post 9 ... Continue reading: 9 new books added to Big Book of R http://dlvr.it/SDqq7K

Advent of 2021, Day 5 – Setting up Spark Cluster

Series of Apache Spark posts: Dec 01: What is Apache Spark Dec 02: Installing Apache Spark Dec 03: Getting around CLI and WEB UI in Apache Spark Dec 04: Spark Architecture – Local and cluster mode We have explore the Spark architecture and look…Read more › Continue reading: Advent of 2021, Day 5 – Setting up Spark Cluster http://dlvr.it/SDpXyH

Easy Interpretations of ADF Test in R

صورة
This post shows how to interpret the results of the augmented Dickey-Fuller (ADF) test easily with the help of Hank Roark's R function. His R function provides kind descriptions of the results of a unit root ADF test. I explains why this description... Continue reading: Easy Interpretations of ADF Test in R http://dlvr.it/SDntjL

Writing Functions in R: Working Example One

A. Background We usually want to write reusable code that is compact and could be utilized in a number of use cases. That involves writing user defined functions that can take particular input values, perform some set of steps, and returns an output. I... Continue reading: Writing Functions in R: Working Example One http://dlvr.it/SDnthG

Sea Monsters that Lost their Way

صورة
library(tidyverse) library(tidymodels) library(probably) library(finetune) library(textrecipes) library(stopwords) library(wesanderson) library(kableExtra) library(clock) library(glue) library(janitor) library(vip) library(ggrepel) library(tictoc) library(doParallel) registerDoParallel(cores = 6) theme_set(theme_bw()) (cols % mutate( date = date_parse(date, format = "%d/%m/%Y"), length = parse_number(length_et), species_lumped = fct_lump_n(species, 20), across(ends_with("_val"), as.integer) ) # glimpse(strandings_df) Exploratory Some of the species labels contain a question mark or forward slash. This indicates uncertainty, so it might be fun to see if a machine ... Continue reading: Sea Monsters that Lost their Way http://dlvr.it/SDmCVR

Why R? 2021 Invited Talks

صورة
This year we are organizing the fifth edition of Why R? 2021 conference. As the meetings are held in English we invite R users from all over the globe! The program is available on our youtube.com/WhyRFoundationVideos channel. Go and check what lecture... Continue reading: Why R? 2021 Invited Talks http://dlvr.it/SDlZ6J

Augmented Dickey-Fuller (ADF) Test in R

صورة
This post explains how to use the augmented Dickey-Fuller (ADF) test in R. The ADF Test is a common statistical test to determine whether a given time series is stationary or not. We explain the interpretation of ADF test results from R package by m... Continue reading: Augmented Dickey-Fuller (ADF) Test in R http://dlvr.it/SDlZ4W

Where to ask for help when coding in R

صورة
When learning R, it can be tough to figure out how to apply what you’ve learned to your own data. We often learn general skills that are helpful for manipulating our data, but things aren’t always so simple when it comes to your own analysi... Continue reading: Where to ask for help when coding in R http://dlvr.it/SDjfWr

Mapping iNaturalist Data in R

صورة
This post was originally published from IGIS as a Tech Note. Enjoy! Introduction iNaturalist is an enormously popular platform for recording and sharing observations of nature. Most people interact with iNaturalist through the Android or iOS phone app, but a little known fact is the platform also has an API (... Continue reading: Mapping iNaturalist Data in R http://dlvr.it/SDjfT8

My Courses are now Free

A few months ago I announced a change in direction: rather than continuing to focus on Choroplethr-related development and introductory R training, I was going to focus on the broader field of Analytics Engineering. Today I am happy to announce that that effort has resulted in a new job. I ... Continue reading: My Courses are now Free http://dlvr.it/SDgbTv

What statistical test should I do?

Being a teaching assistant in statistics for students with diverse backgrounds, I have the chance to see what is globally not well understood by students. I have realized that it is usually not a problem for students to do a specific statistical te... Continue reading: What statistical test should I do? http://dlvr.it/SDf68F

Why R? 2021 Keynotes

صورة
This year we are organizing the fifth edition of Why R? 2021 conference. As the meetings are held in English we invite R users from all over the globe! The program is available on our youtube.com/WhyRFoundationVideos channel. Go and check what lecture... Continue reading: Why R? 2021 Keynotes http://dlvr.it/SDd5Ms

2021 Advent of Code Day 02 “Don’t Try This At Home” Edition

The Moderna booster level drained me all day on Dec 1 and did what jab two did during the overnight period (achy enough to wake me up and not get back to slumber easily). To try to wear myself down, I decided to practice a bit of R with the 2021 Advent ... Continue reading: 2021 Advent of Code Day 02 “Don’t Try This At Home” Edition http://dlvr.it/SDbRQv

R Consortium Funding Delivers New Format for the R Journal

By Di Cook, Mitch O’Hara-Wild, H. Sherry Zhang, Stephanie Kobakian, Michael Kane, Catherine Hurley, Simon Urbanek The R Journal is a primary outlet for publishing research of interest to the R community.... The post R Consortium Funding Delivers New Format for the R Journal appeared first on R Consortium. Continue reading: R Consortium Funding Delivers New Format for the R Journal http://dlvr.it/SDZm7H

Advent of 2021, Day 1 – What is Apache Spark?

صورة
Apache Spark is an open-source unified analytics engine for large-scale data processing. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance. Originally it was developed at the Berkeley’s AMPLab, and later donated to the…Read more › Continue reading: Advent of 2021, Day 1 – What is Apache Spark? http://dlvr.it/SDZm4l

Advent of Code 2021

It’s that time of year again. And not just for Secret Santa—it’s time for the Advent of Code, a series of programming puzzles in the lead-up to Christmas. I’m doing the 2021 challenge in R—in the form of an open-source R package, to demonstrate a test-driven ... Continue reading: Advent of Code 2021 http://dlvr.it/SDYyXr

The virtual 2021 R Government & Public Sector Conference is coming December 9th & 10th

صورة
We are so excited to bring back the fourth annual Government and Public Sector R Conference to a computer screen near you! Join us on December 9th and 10th for a jam-packed two-day conference filled with an incredible lineup of speakers across the government and public sector space. Buy your ... Continue reading: The virtual 2021 R Government & Public Sector Conference is coming December 9th & 10th http://dlvr.it/SDYMV5