المشاركات

عرض المشاركات من يونيو, 2023

Tidy Tuesday Revisited: Interactive Map of Arlington Historic Neighborhoods

صورة
This week’s TidyTuesday is about place names as recorded by the US Board on Geographic Names. The dataset has been cleaned to include only populated places. I ended up augmenting the dataset with information about Arlington Historic neighborhoods ... Continue reading: Tidy Tuesday Revisited: Interactive Map of Arlington Historic Neighborhoods http://dlvr.it/SrS9xj

Fast creation of LFR Benchmark graphs in R

صورة
Almost a year ago, I introduced my most recent network related R packages netUtils. The package is meant as a catch-all package where I gather network analytic methods that haven’t yet been implemented in R and may not deserve there own package. Bet... Continue reading: Fast creation of LFR Benchmark graphs in R http://dlvr.it/SrQgwN

Exploring Rolling Correlation with the rollapply Function: A Powerful Tool for Analyzing Time-Series Data

صورة
Introduction In the world of data analysis, time-series data is a common sight. Whether it’s stock prices, weather patterns, or website traffic, understanding the relationship between variables over time is crucial. One valuable technique in thi... Continue reading: Exploring Rolling Correlation with the rollapply Function: A Powerful Tool for Analyzing Time-Series Data http://dlvr.it/SrNJP8

ggplotting power curves from simr package

صورة
The R package simr has greatly facilitated power analysis for mixed-effects models using Monte Carlo simulation (i.e., hundreds or thousands of tests under slight variations of the data). The powerCurve function is used to estimate the statistical power for various sample sizes in one go. Since it runs serially, ... Continue reading: ggplotting power curves from simr package http://dlvr.it/SrMQzP

A Journey through Arrow in R

صورة
How to join this free online event with Stephanie Hazlitt, Jonathan Keane and Nic Crane. Apache Arrow is a software development platform for building high performance applications that process and transport large data sets. It is designed to improve the performance of data analysis methods, and to increase the efficiency ... Continue reading: A Journey through Arrow in R http://dlvr.it/SrLcBX

Tidy Tuesday: US Populated Places

صورة
Today’s TidyTuesday is about place names as recorded by the US Board on Geographic Names. The dataset has been cleaned to include only populated places. This week will involve more libraries than normal, since I am going to play with mapping. library(tidyverse) # who doesn't want to be tidy? ... Continue reading: Tidy Tuesday: US Populated Places http://dlvr.it/SrLc3L

Tidy Freedom Index as an R Package

صورة
R and Shiny Training: If you find this blog to be interesting, please note that I offer personalized and group-based training sessions that may be reserved through Buy me a Coffee. Additionally, I provide training services in the Spanish language ... Continue reading: Tidy Freedom Index as an R Package http://dlvr.it/SrKV2K

The differences of left join in SQL and R

صورة
Recently, I encountered a situation where I needed to translate an Access SQL query to R, and I noticed the contrasting behaviors of these two languages when it comes to handling NA/NULL values in left joins. The impact of NA/NULL values on joins... Continue reading: The differences of left join in SQL and R http://dlvr.it/SrKTxY

R for Predictive Modeling and Data Visualization in Turkey

صورة
Mustafa Cavus, organizer of the Eskisehir R User Group, in Turkey, discussed the diverse and thriving R community in Eskisehir. He shared the details of a 4-day event hosted by... The post R for Predictive Modeling and Data Visualization in Turkey appeared first on R Consortium. Continue reading: R for Predictive Modeling and Data Visualization in Turkey http://dlvr.it/SrGt8N

Visualization in R: Unleashing the Power of the abline() Function

صورة
Introduction Welcome to the world of data visualization in R! In this blog post, we will explore the abline() function, a versatile tool that allows you to add straight lines to your plots effortlessly. Whether you’re a beginner or an experience... Continue reading: Visualization in R: Unleashing the Power of the abline() Function http://dlvr.it/SrGW3n

Order Constraints in Bayes Models (with brms)

صورة
Over a year ago, while listing to a very not-at-all-statistical podcast, I discovered that Bayesian modeling is widely used in archaeology since the mid 90s to calibrate carbon dating.1 Carbon dating is a scientific method used to determine the a... Continue reading: Order Constraints in Bayes Models (with brms) http://dlvr.it/SrG3xZ

Student’s t-test explained with R and Pokemon

صورة
R and Shiny Training: If you find this blog to be interesting, please note that I offer personalized and group-based training sessions that may be reserved through Buy me a Coffee. Additionally, I provide training services in the Spanish language ... Continue reading: Student’s t-test explained with R and Pokemon http://dlvr.it/SrDsbx

A new function to plot convergence diagnostics from lme4::allFit()

صورة
Linear mixed-effects models (LMM) offer a consistent way of performing regression and analysis of variance tests which allows accounting for non-independence in the data. Over the past decades, LMMs have subsumed most of the General Linear Model, with a steady increase in popularity (Meteyard & Davies, 2020). Since their conception, LMMs have ... Continue reading: A new function to plot convergence diagnostics from lme4::allFit() http://dlvr.it/SrCKCw

How to break up colour variable in sjPlot into equally-sized bins

صورة
Whereas the direction of main effects can be interpreted from the sign of the estimate, the interpretation of interaction effects often requires plots. This task is facilitated by the R package sjPlot (Lüdecke, 2022). For instance, using the plot_model function, I plotted the interaction between two continuous variables. library(lme4) #> Loading required package: Matrix library(sjPlot) library(ggplot2) theme_set(theme_sjplot()) # Create data using code by Ben Bolker from # https://stackoverflow.com/a/38296264/7050882 set.seed(101) spin = runif(600, 1, 24) reg = runif(600, 1, 15) ID = rep(c("1","2","3","4","5", "6", "7", "8", "9", "10")) day = rep(1:30, each = 10) testdata http://dlvr.it/SrBn7J

survivoR 2.1.0 is now available

صورة
Good news everyone! survivoR 2.1.0 is now available. The most exciting feature of this release is the new confessional timing app. The post survivoR 2.1.0 is now available appeared first on Dan Oehm | Gradient Descending. Continue reading: survivoR 2.1.0 is now available http://dlvr.it/Sr94C3

rOpenSci News Digest, June 2023

Dear rOpenSci friends, it’s time for our monthly news roundup! You can read this post on our blog. Now let’s dive into the activity at and around rOpenSci! rOpenSci HQ Meeting the stars of the R-universe: PEcAn, an Open Source Project ... Continue reading: rOpenSci News Digest, June 2023 http://dlvr.it/Sr7PSP

How Can Someone Else Use My Model?

صورة
This post is part of a series called The Missing Semester of Your DS Education. Introduction At this point in this series, I’ve discussed a lot of aspects of putting machine learning into production. I’ve gone over workflow orchestration... Continue reading: How Can Someone Else Use My Model? http://dlvr.it/Sr5284

All About Coworking Sessions with rOpenSci

صورة
We first introduced our Coworking and Office Hour sessions in August 2021 after a successful pilot of several ’label-athon’s in the April and May prior. We’ve had a successful couple of years since then and the coworking sessions hav... Continue reading: All About Coworking Sessions with rOpenSci http://dlvr.it/Sr2j9L

Miami-Dade County Public Employee Salary Research – An Analysis in R, Python, and Julia

صورة
This analysis was co-authored by data scientists, Scott Fisher and Douglas Davila-Pestana. Mr. Fisher developed the Python code for the […] The post Miami-Dade County Public Employee Salary Research – An Analysis in R, Python, and Julia first appeared on Remix Institute. The post Miami-Dade County Public Employee Salary Research – An Analysis ... Continue reading: Miami-Dade County Public Employee Salary Research – An Analysis in R, Python, and Julia http://dlvr.it/Sr1j2L

Mastering Data Aggregation with xtabs() in R

Introduction As a programmer, you’re constantly faced with the task of organizing and analyzing data. One powerful tool in your R arsenal is the xtabs() function. In this blog post, we’ll explore the versatility and simplicity of xtabs() for agg... Continue reading: Mastering Data Aggregation with xtabs() in R http://dlvr.it/SqzXfF

Finding logistic models to generate data with desired risk ratio, risk difference and AUC profiles

صورة
About two years ago, someone inquired whether simstudy had the functionality to generate data from a logistic model with a specific AUC. It did not, but now it does, thanks to a paper by Peter Austin that describes a nice algorithm to accomplish thi... Continue reading: Finding logistic models to generate data with desired risk ratio, risk difference and AUC profiles http://dlvr.it/SqxNbX

rOpenSci Champions Program Teams: Meet Alican Cagri Gokcek and Elio Campitelli

We designed the rOpenSci Champions Program with a mentorship aspect. Mentoring plays a significant role in the growth and development of both mentors and mentees alike. In our program, each Champion has a mentor who accompanies them during their train... Continue reading: rOpenSci Champions Program Teams: Meet Alican Cagri Gokcek and Elio Campitelli http://dlvr.it/SqwZqm

The Naming of Stats

The Naming of Stats is a difficult matter,      It isn’t just one of your holiday games; You may think at first I’m as mad as a hatter When I tell you, a stat must have THREE DIFFERENT NAMES. First of all are the names where usage is informal,      ... Continue reading: The Naming of Stats http://dlvr.it/SqwGb4

Leveraging generic type hints of classes in Python

Introduction Python, a versatile and dynamically-typed language, gained significant enhancements with the introduction of type hinting in Python 3.5. Often though, there are cases where the specific type of an input and output of a function are not kno... Continue reading: Leveraging generic type hints of classes in Python http://dlvr.it/SqtY7h

Simpler drawing of Pacific choropleth maps by @ellis2013nz

Simple use of the draw_pac_map() function Last year I blogged about making a choropleth map of the Pacific, dealing with a few technical details like getting the map centred on the Pacific, using exclusive economic zones (EEZs) to colour things in for... Continue reading: Simpler drawing of Pacific choropleth maps by @ellis2013nz http://dlvr.it/SqrWVh

Mastering the Power of R’s diff() Function: A Programmer’s Guide

Introduction As a programmer, it’s crucial to have a deep understanding of the tools at your disposal. In the realm of data analysis and manipulation, R stands as a powerhouse. One function that proves to be invaluable in many scenarios is diff(... Continue reading: Mastering the Power of R’s diff() Function: A Programmer’s Guide http://dlvr.it/SqpYy5

How do you measure the impact of a champions program?

This post can also be found on the CSCCE blog." How do you measure the impact of a champions program? This was the central question of a working session at CZI’s Accelerating Open Science in Latin America workshop, convened by rOpenSci’s Commun... Continue reading: How do you measure the impact of a champions program? http://dlvr.it/SqnCkd

Introduction to Linear Regression in R: Analyzing the mtcars Dataset with lm()

Introduction The lm() function in R is used for fitting linear regression models. It stands for “linear model,” and it allows you to analyze the relationship between variables and make predictions based on the data. Let’s dive into the parameter... Continue reading: Introduction to Linear Regression in R: Analyzing the mtcars Dataset with lm() http://dlvr.it/SqltnW

World Cup – World Soccer Analytics

World Soccer Analytics Analyzing the beautiful game with a data lens Menu Skip to content Home About Contact Facebook LinkedIn Twitter Instagram R-Bloggers! Tag: World Cup August 16, 2018August 25, 2018 Stefan Gouyet Winners at the World Cup Blog at WordPress.com. Tag: World Cup August 16, 2018August 25, 2018 Stefan Gouyet Winners at the World […] Continue reading: World Cup – World Soccer Analytics http://dlvr.it/SqltYm

Football – World Soccer Analytics

World Soccer Analytics Analyzing the beautiful game with a data lens Menu Skip to content Home About Contact Facebook LinkedIn Twitter Instagram R-Bloggers! Tag: Football August 16, 2018August 25, 2018 Stefan Gouyet Winners at the World Cup August 11, 2018August 23, 2018 Stefan Gouyet A Change in Age at the Olympics August 7, 2018August 10, […] Continue reading: Football – World Soccer Analytics http://dlvr.it/Sqj1gK

Winners at the World Cup – World Soccer Analytics

Winners at the World Cup The 2018 FIFA World Cup has been over for a month now but the memories are far from fading. It was a fabulous tournament, combining underdog stories, upsets, and some pretty high quality football.The World Cup has been around for 88 years now, featuring 79 national teams ... Continue reading: Winners at the World Cup – World Soccer Analytics http://dlvr.it/SqgTp2

Comments on: A Change in Age at the Olympics

[This article was first published on worldsocceranalytics.com, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. To leave a comment for the author, please follow the link and comment on their blog: worldsocceranalytics.com. R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job. Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. Continue reading: Comments on: A Change in Age at the Olympics http://dlvr.it/SqcbRs

Kakhovka Dam Disaster

Some History The war in Ukraine has spawned yet another disaster, the destruction of the dam across the Dnipro river, upstream from Kherson City. This is an ecologial and humanitarian disaster as vast acres of settlements, farmlands and wetlands have been destroyed. This marks the third time a dam in ... Continue reading: Kakhovka Dam Disaster http://dlvr.it/SqZghT

rOpenSci Champions Program Teams: Meet Cheryl Isabella Lim and Mauro Lepore

We designed the rOpenSci Champions Program with a mentorship aspect. Mentoring plays a significant role in the growth and development of both mentors and mentees alike. In our program, each Champion has a mentor who accompanies them during their train... Continue reading: rOpenSci Champions Program Teams: Meet Cheryl Isabella Lim and Mauro Lepore http://dlvr.it/SqZQ3j

Building a basic Shiny app with Golem – Part I (Video)

صورة
R and Shiny Training: If you find this blog to be interesting, please note that I offer personalized and group-based training sessions that may be reserved through Buy me a Coffee. Additionally, I provide training services in the Spanish language ... Continue reading: Building a basic Shiny app with Golem – Part I (Video) http://dlvr.it/SqSB2G

Why GLMs should be a priority when teaching statistics

Why GLMs should be a priority when teaching statistics There is often debate over the best place to start when teaching students inferential statistics. The teaching of statistics should start with basic understanding of what data are, probability and... Continue reading: Why GLMs should be a priority when teaching statistics http://dlvr.it/SqPLS8

A Gentle Introduction to Docker

This post is part of a series called The Missing Semester of Your DS Education. Introduction If you’re doing data science work, it’s likely you’ll eventually come across a situation where you need to run your code somewhere else. Whether... Continue reading: A Gentle Introduction to Docker http://dlvr.it/SqLJ2n

Debunking the idea that cpp11 does not offer OpenMP support

R and Shiny Training: If you find this blog to be interesting, please note that I offer personalized and group-based training sessions that may be reserved through Buy me a Coffee. Additionally, I provide training services in the Spanish language ... Continue reading: Debunking the idea that cpp11 does not offer OpenMP support http://dlvr.it/SqHBKW

Tidying the Freedom Index

صورة
R and Shiny Training: If you find this blog to be interesting, please note that I offer personalized and group-based training sessions that may be reserved through Buy me a Coffee. Additionally, I provide training services in the Spanish language ... Continue reading: Tidying the Freedom Index http://dlvr.it/SqD1j6

Upcoming changes to popular R packages for spatial data: what you need to do

The issue Three popular R packages for spatial data handling won’t be available on CRAN after October 2023.1 These packages are: rgdal: a package that provides bindings to the GDAL and PROJ libraries. In other words, it gives a capability of rea... Continue reading: Upcoming changes to popular R packages for spatial data: what you need to do http://dlvr.it/SqC6Ll

Mastering file download in shiny

You can read the original post in its original format on Rtask website by ThinkR here: Mastering file download in shiny But why does downloadHandler now return an empty file!? Context When we start working with {shiny}, we often reach a point where it is necessary to offer the user ... Continue reading: Mastering file download in shiny http://dlvr.it/Sq9Jv0

Simulating confounders, colliders and mediators by @ellis2013nz

صورة
Motivation and key points So this tweet came across my feed. It refers to this article, Statistical Control Requires Causal Justification by Wysocki, Lawson and Rhemtulla, published in June last year (2022) in Advances in Methods and Practices in Psyc... Continue reading: Simulating confounders, colliders and mediators by @ellis2013nz http://dlvr.it/Sq6jmx

Sorting, Ordering, and Ranking: Unraveling R’s Powerful Functions

Introduction In the realm of data analysis and programming, organizing and sorting data efficiently is crucial. In R, a programming language renowned for its data manipulation capabilities, we have three powerful functions at our disposal: order... Continue reading: Sorting, Ordering, and Ranking: Unraveling R’s Powerful Functions http://dlvr.it/Sq4W4j

Troubleshooting Pandoc Problems as an R User

The Pandoc CLI by John MacFarlane is a really useful tool: for instance, rOpenSci community manager Yanina Bellini Saibene recently asked Maëlle whether she could convert a Google Document into a Quarto book. Maëlle solved the request with a combinati... Continue reading: Troubleshooting Pandoc Problems as an R User http://dlvr.it/Sq1cG0

Downloading snapshots and creating stable R packages repositories using r-universe

صورة
Downloading repository snapshots The new snapshot API lets you download a full copy of any CRAN-like repository on r-universe. You can use such a snapshot to mirror the entire CRAN-like repository on your own servers, or for example to build a stable... Continue reading: Downloading snapshots and creating stable R packages repositories using r-universe http://dlvr.it/SpyZz1