المشاركات

عرض المشاركات من يونيو, 2024

Unlocking the Power of Administrative Data with healthyR.data

Introduction Hello, fellow data enthusiasts! Today, I’m excited to share insights into the { healthyR.data } package, an essential tool in the healthyverse that will streamline your data exploration and testing processes. Whether you’re a seasoned data scientist or just starting out in data analytics, this package is designed ... Continue reading: Unlocking the Power of Administrative Data with healthyR.data http://dlvr.it/T90Khc

Little useless-useful R functions – Dragon curve

صورة
Let’s play with some dragons. Dragons from the Jurassic park or the board game dungeon and dragons. The algorithm is a fractal curve of Hausdorff dimension 2. One starts with one segment. In each iteration the number of segments is…Read more › Continue reading: Little useless-useful R functions – Dragon curve http://dlvr.it/T8zmTW

Calculating Z-Scores in R: A Step-by-Step Guide

The post Calculating Z-Scores in R: A Step-by-Step Guide appeared first on Data Science Tutorials Unravel the Future: Dive Deep into the World of Data Science Today! Data Science Tutorials. Calculating Z-Scores in R: A Step-by-Step Guide, Z-scores measure how many standard deviations an individual data value is from the ... Continue reading: Calculating Z-Scores in R: A Step-by-Step Guide http://dlvr.it/T8zmLW

Big Update to Big Book of R (new look, new chapter, new books)

29 June 2024 I’m super-excited to announce a bumper Big Book of R update. Quarto update Firstly, you’ll see that the site has been updated from bookdown to Quarto. Not only does it give us a nice visual update, the search function seems to work a lot better, and I … ... Continue reading: Big Update to Big Book of R (new look, new chapter, new books) http://dlvr.it/T8xgNT

SHAP Values of Additive Models

صورة
This post investigates properties of SHAP values of additive models. Continue reading: SHAP Values of Additive Models http://dlvr.it/T8wScl

Shiny ducks: connecting to MotherDuck from Shiny

صورة
In a previous post I wrote about how to connect to MotherDuck from R. However, the process described there, where you click in the browser to authenticate, wouldn’t really work with a Shiny app, or for that matter with any productionized setup. A... Continue reading: Shiny ducks: connecting to MotherDuck from Shiny http://dlvr.it/T8wSR8

Evaluation of the UEFA Euro 2024 group stage forecast

صورة
A look back on the group stage of the UEFA Euro 2024 to check whether our ensemble machine learning forecasts based were any good... How surprising was the group stage? This week the group stage of the UEFA Euro 2024 was conclu... Continue reading: Evaluation of the UEFA Euro 2024 group stage forecast http://dlvr.it/T8wSMH

How to Execute VBA Code in Excel via R using RDCOMClient

Introduction Hey everyone, Today, I want to share a neat way to bridge the gap between R and Excel using VBA. Specifically, we’ll look at how to run VBA code in Excel directly from R. This can be incredibly useful if you’re looking to automate ... Continue reading: How to Execute VBA Code in Excel via R using RDCOMClient http://dlvr.it/T8vXVs

A timeline of R’s first 30 years

صورة
August 2023 marked the thirtieth anniversary of the first public release of the R programming language. To celebrate this, and to show how far the language has evolved across those three decades, the timeline below shows some landmark events, p... Continue reading: A timeline of R’s first 30 years http://dlvr.it/T8srgV

Exploring Random Walks and Brownian Motions with healthyR.ts

صورة
Introduction In the world of time series analysis, Random Walks, Brownian Motion, and Geometric Brownian Motion are fundamental concepts used in various fields, including finance, physics, and biology. Today, we’ll explore these concepts using ... Continue reading: Exploring Random Walks and Brownian Motions with healthyR.ts http://dlvr.it/T8rv1n

Interactive Map Filter in Shiny

صورة
Introduction Recently, I participated in Posit’s 2024 Table Contest. For my submission, which you can view here, I included a leaflet map that acts as a filter in Shiny. This is a cool, dashboard-like feature similar to what you might find in P... Continue reading: Interactive Map Filter in Shiny http://dlvr.it/T8rtdN

How to load and save vector data in R

صورة
Introduction Vector loading Shapefile (.shp) GeoPackage (.gpkg) GeoJSON (.geojson) Vector saving Synopsis Supplement Zipped shapefile (.shz) Virtual File Systems SQL preselection ... Continue reading: How to load and save vector data in R http://dlvr.it/T8rtBZ

Updates to the Big Book of R

صورة
The Big Book of R provides a comprehensive and ever-growing overview of a broad selection of R programming books. It was created and is maintained by Oscar Baruffa. The collection began with approximately 100 books and, with the help of contributions from the R community, has subsequently expanded to over 400. The ... Continue reading: Updates to the Big Book of R http://dlvr.it/T8rsrz

Celebrating a Decade of EARL: Join us in Brighton, UK for EARL 2024!

صورة
See the full agenda and book your tickets today for EARL 2024 and key in RBTBZO at the checkout for a limited (while stocks last) 10% saving on your tickets as a special for R bloggers readers. The Enterprise Applications of the R Language (EARL) Conference is a cross-sector tech conference focusing ... Continue reading: Celebrating a Decade of EARL: Join us in Brighton, UK for EARL 2024! http://dlvr.it/T8q9Wj

Election

صورة
French and European flags – CC-BY-NC-ND by Ibrahim Ajaja / World Bank Election maps are hard: Land doesn’t vote, people do !. So most maps made on these occasions (generally choropleths) are possibly misleading. Figure 1: Car... Continue reading: Election http://dlvr.it/T8GBDX

A fresh new look for R-universe!

صورة
A whole new frontend! As you may have noticed, we have given the WebUI for R-universe a big refresh. This is the biggest UX overhaul since the beginning of the project. The old “dashboard” had become a bit convoluted over the years as fea... Continue reading: A fresh new look for R-universe! http://dlvr.it/T8G9yt

VBA Code to Check if a Sheet Exists

صورة
Introduction In today’s post we are going to go over VBA code to check if a sheet exists and then we are going to call that function from R using the RDCOMClient package. This can be useful when you need to perform certain actions based on the ... Continue reading: VBA Code to Check if a Sheet Exists http://dlvr.it/T8BNrp

Keith Karani Wachira: Leading the Dekut R Community in Kenya and Innovating with R 

صورة
Keith Karani Wachira, the Dekut R Community organizer based in Nyeri, Kenya, was recently interviewed by the R Consortium and shared his journey in the R community, which began in... The post Keith Karani Wachira: Leading the Dekut R Community in Kenya and Innovating with R  appeared first on R ... Continue reading: Keith Karani Wachira: Leading the Dekut R Community in Kenya and Innovating with R  http://dlvr.it/T88hWT

Meta-Analysis in R workshop

Join our workshop on Meta-Analysis in R, which is a part of our workshops for Ukraine series!  Here’s some more info:  Title: Meta-Analysis in R Date: Thursday, July 11th, 18:00 – 20:00 CEST (Rome, Berlin, Paris timezone)  Speaker: Matthew B. Jané is a graduate student in quantitative psychology at the University of ... Continue reading: Meta-Analysis in R workshop http://dlvr.it/T88hNY

Extracting Numbers from Strings in R

Introduction Hello! Today, we’ll jump into something I think is a pretty neat task in data processing: extracting numbers from strings. We’ll explore three different methods using base R, the stringr package, and the stringi package. Each metho... Continue reading: Extracting Numbers from Strings in R http://dlvr.it/T88hCT

Hack your way to a good Git history

I’ve now explained on this blog why it’s important to have small, informative Git commits1 and how I’ve realized that polishing history can happen in a second phase of work in a branch. However, I’ve more or less glossed over ho... Continue reading: Hack your way to a good Git history http://dlvr.it/T88CXV

Introduction to My Content Series

Introduction Hello Everyone, I’m excited to kick off a new content series dedicated to reviewing and exploring the R packages I’ve developed. Over the coming weeks, I’ll be diving into the details, features, and practical applications of each ... Continue reading: Introduction to My Content Series http://dlvr.it/T871xC

shiny.gosling is Now on Bioconductor – An Open-Source Software for Bioinformatics

صورة
Inclusion of our shiny.gosling package on Bioconductor is a three-fold milestone for us. First, it gives our developers the recognition they deserve. Second, it strengthens our position as a leading voice in bioinformatics software development. And finally, it amplifies our visibility and credibility within the scientific community. For you, ... Continue reading: shiny.gosling is Now on Bioconductor – An Open-Source Software for Bioinformatics http://dlvr.it/T85KS0

Forecasting the UEFA Euro 2024 with a machine learning ensemble

صورة
Probabilistic forecasts for the UEFA Euro 2024 are obtained by using a hybrid model that combines data from four advanced statistical models. The favorite is France, followed by England and host Germany. Football fans arou... Continue reading: Forecasting the UEFA Euro 2024 with a machine learning ensemble http://dlvr.it/T83cJG

New Paper on Data Privacy

Readers who are interested in the Data Privacy field may find our new paper (Perry, Matloff, Tendick) of interest, https://tdp.cat/issues21/tdp.a478a22.pdf…. There we introduce a new method that we call RWN, Randomization within Neighborhoods. We present a bit of supporting theory and do some ... Continue reading: New Paper on Data Privacy http://dlvr.it/T82ScV

Why you shouldn’t use boxplots

صورة
Box plots are a very common tool in data visualization to show how your data is distributed. But they have a crucial flaw. Let’s find out what that flaw is. And if you’re interested in the video version of this blog post, you can find it here: ... Continue reading: Why you shouldn’t use boxplots http://dlvr.it/T81fpM

The Hitchhiker’s Guide to Linear Models

About The Hitchhiker’s Guide to Linear Models can be downloaded for free (or for a suggested price of 10 USD) from Leanpub. For every exercise I did my best to connect the specific statistical concepts with R code, and every time I use linear alg... Continue reading: The Hitchhiker’s Guide to Linear Models http://dlvr.it/T81H2r

How to Check if a Character is in a String in R

Introduction When working with text data in R, one common task is to check if a character or substring is present within a larger string. R offers multiple ways to accomplish this, ranging from base R functions to packages like stringr and stri... Continue reading: How to Check if a Character is in a String in R http://dlvr.it/T81GrW

Matrices with fixed row and column sums

Given two vectors \(p\) and \(q\) of non-negative integer numbers, denote by \(A(p, q)\) the number of matrices with non-negative integer entries whose row sum and column sum respectively are \(p\... Continue reading: Matrices with fixed row and column sums http://dlvr.it/T81Gb1

LogAnalyzer: An Easy-to-Use Log Monitoring Tool for R/Shiny Applications

صورة
Posit Connect is often the de facto platform to deploy R/Shiny applications, Python applications, and Quarto-Shiny documents. And when something goes wrong, we immediately go and visit the logs. But often the logs are walls of text that we have to scour through and when you have been coding ... Continue reading: LogAnalyzer: An Easy-to-Use Log Monitoring Tool for R/Shiny Applications http://dlvr.it/T7yG4P

Public Pinball Machines per Capita: A new global indicator

صورة
There are tons of well-known global indicators. We’ve all heard of gross domestic product, life expectancy, rate of literacy, etc. But, ever since I discovered pinballmap.com, possibly the world’s most comprehensive database of public pinball locations, I’ve been thinking about a potential new global indicator: Public ... Continue reading: Public Pinball Machines per Capita: A new global indicator http://dlvr.it/T7yFVc

Introduction of My Content Series

Introduction Hello, everyone! I’m excited to introduce a new content series that will be shared across multiple platforms, including my blog, LinkedIn, Mastodon, and Telegram. This series is designed to provide you with valuable insights, pract... Continue reading: Introduction of My Content Series http://dlvr.it/T7xfPZ

From scripts to package. Developing dendroNetwork and learning with rOpenSci

Developing dendroNetwork as a package was not a goal from the beginning, but looking back, I think that it should have been. I wish someone had suggested making a package to me much earlier. Why? Because of many things, but mostly: reproducibility and... Continue reading: From scripts to package. Developing dendroNetwork and learning with rOpenSci http://dlvr.it/T7xf7t

Install PostGIS on Red Hat Enterprise Linux (RHEL) 8

Motivation I was ready to copy a table with a geometry column from R to PostgreSQL, and I saw this error that wasn’t there yesterday: Error: Failed to fetch row : ERROR: type "geometry" does not exist at character 108 I ran sudo -i -u postgres; ... Continue reading: Install PostGIS on Red Hat Enterprise Linux (RHEL) 8 http://dlvr.it/T7w78G

R One Billion Row Challenge: Is R Viable Option for Analyzing Huge Datasets?

صورة
Note: Thank you Kamil Żyła for providing guidance and expertise in writing R code for this article. R, being single-threaded in nature, isn’t the fastest programming language out there. You have options when it comes to parallelism, but these often don’t reduce the runtime as much as ... Continue reading: R One Billion Row Challenge: Is R Viable Option for Analyzing Huge Datasets? http://dlvr.it/T7vkdR

The Sanctuary: Stats and data from {survivoR}

صورة
I wanted a space to throw all my tables and charts made using the {survivoR} R package into, so I […] The post The Sanctuary: Stats and data from {survivoR} appeared first on Dan Oehm | Gradient Descending. Continue reading: The Sanctuary: Stats and data from {survivoR} http://dlvr.it/T7tQ3R

How to Split a Character String and Get the First Element in R

Introduction Hello, R community! Today, we’re jumping into a common yet powerful task in data manipulation: splitting character strings and extracting the first element. We’ll explore how to accomplish this in base R, as well as using the strin... Continue reading: How to Split a Character String and Get the First Element in R http://dlvr.it/T7t3sj

Two Roads Diverged

صورة
Two roads diverged in a wood and I, I took the one less traveled by, and that has made all the difference. A little-known historical tidbit is that Robert Frost’s The Road Less Traveled - so often cited as a celebration of individuality and difficult choices - was in ... Continue reading: Two Roads Diverged http://dlvr.it/T7t3WY

Unveiling New Tools in the TidyDensity Arsenal: Distribution Parameter Wrangling

Introduction Greetings, fellow data enthusiasts! Today, we’re thrilled to unveil a fresh wave of functionalities in the ever-evolving TidyDensity package. Buckle up, as we delve into the realm of distribution statistics! This update brings a bo... Continue reading: Unveiling New Tools in the TidyDensity Arsenal: Distribution Parameter Wrangling http://dlvr.it/T7rN5D

PowerQuery Puzzle solved with R

صورة
#187–188PuzzlesAuthor: ExcelBIAll files (xlsx with puzzle and R with solution) for each and every puzzle are available on my Github. Enjoy.Puzzle #187There are two ways of showing data and it really depends on what is the purpose of certain presentatio... Continue reading: PowerQuery Puzzle solved with R http://dlvr.it/T7q6H3

Coloured text in {ggplot2}: {ggtext} vs {marquee}

صورة
When you use colour to denote the values of a variable in a visualisation, it’s very common to add a legend showing how the colours map to different values. If you create your charts using {ggplot2}, a legend is added automatically when you add c... Continue reading: Coloured text in {ggplot2}: {ggtext} vs {marquee} http://dlvr.it/T7q60P

An Overview of the New Parameter Estimate Functions in the TidyDensity Package

Introduction Hello, R enthusiasts! I’m excited to share some fantastic updates to the TidyDensity package. These updates introduce a suite of parameter estimate functions designed to make your data analysis more efficient and insightful. Whethe... Continue reading: An Overview of the New Parameter Estimate Functions in the TidyDensity Package http://dlvr.it/T7p3hN

Polytomous Latent Class Analysis and Regression in R workshop

Join our workshop on Polytomous Latent Class Analysis and Regression in R which is a part of our workshops for Ukraine series!  Here’s some more info:  Title: Polytomous Latent Class Analysis and Regression in R Date: Wednesday, July 3rd, 18:00 – 20:00 CEST (Rome, Berlin, Paris timezone) Speaker: Lana Bojanić is a ... Continue reading: Polytomous Latent Class Analysis and Regression in R workshop http://dlvr.it/T7nMcC

Why you need small, informative Git commits

صورة
“Make small Git commits with informative messages” is a piece of advice we hear a lot when learning Git. That’s why we might want to sometimes rewrite history in a branch. In this post, I’d like to underline three main (😉) reaso... Continue reading: Why you need small, informative Git commits http://dlvr.it/T7n29m

pRotectionism: The way to compete in 2024

Have you ever......spent hours, maybe even days, tackling a tricky coding problem at work, only to discover a better, cleaner, and faster open-source alternative? UGH! There’s got to be a better way!You're in luck!Inspired by the populist policies of r... Continue reading: pRotectionism: The way to compete in 2024 http://dlvr.it/T7lb53

Shapefile must… be replaced

The ESRI Shapefile format is possibly the most popular data format for storing spatial vector data. It is well-established, widely supported, and can be used in most GIS software. At the same time, it has many shortcomings and limitations: it is ... Continue reading: Shapefile must… be replaced http://dlvr.it/T7lZxT

Simple custom colour palettes with R ggplot graphs

صورة
A simple, yet effective way to set your colour palette in R using ggplot library. After that, we can start “chaining” ggplot graphs. Scatter plot Barchart / Histogram Boxplot 3.14 chart Finally custom colours! As always, the complete code is…Read more › Continue reading: Simple custom colour palettes with R ggplot graphs http://dlvr.it/T7kVs0

RStudio Shortcuts and Settings

RStudio is the premier way to write R code. It is very user-friendly as it’s super easy to set up. But it also has a lot of pretty advanced features. In today’s blog post I’ll give you a list of my favorite RStudio tricks and shortcuts. If ... Continue reading: RStudio Shortcuts and Settings http://dlvr.it/T7jRpV

Easy data cleaning with the janitor package

صورة
The janitor package contains only a little number of functions but nevertheless it is surprisingly convenient to use them. I never really fully appreciated its functionality until I took a look into the documentation. So let’s dive into this pack... Continue reading: Easy data cleaning with the janitor package http://dlvr.it/T7jRcl

Creating R tutorial worksheets (with and without solutions) using Quarto

صورة
When I’m teaching R, I often have exercise questions for students to work through during our workshop sessions. So I need to create a tutorial worksheet. Although we often go through solutions to the exercises in the workshop, I also want to crea... Continue reading: Creating R tutorial worksheets (with and without solutions) using Quarto http://dlvr.it/T7hmSk