المشاركات

عرض المشاركات من فبراير, 2023

Distribution generalization in causal inference

صورة
Distribution generalization in causal inference Monday, March 20th, 2023, 7:00 PT / 10:00 ET / 15:00 CET 1st joint webinar of the IMS New Researchers Group, Young Data Science Researcher Seminar Zürich and the YoungStatS Project. When & Where: ... Continue reading: Distribution generalization in causal inference http://dlvr.it/Sk7PRh

Kaggle Playground Series – Tidymodels

صورة
Hello readers, we are entering another Kaggle playground competition, so get your Yorkshire tea ready and enjoy the process of joining. This month the competition I entered is this one https://www.kaggle.com/competitions/playground-series-s3e7It’seiew It’s looks like looks are canncellations from hotels and spoiler ... Continue reading: Kaggle Playground Series – Tidymodels http://dlvr.it/Sk6W1K

Save space in faceted plots

صورة
Faceting1 is probably the most distinctive feature that defined the early success and wide adoption of ggplot2. Small-multiples are often a great dataviz choice.2 But one common problem is when your panels for the subsets of data requite vastly diffe... Continue reading: Save space in faceted plots http://dlvr.it/Sk5hTC

Code snippets – regular expressions

صورة
Inspired by conversations on the NHS-R Slack where code answers are lost over time (it’s not a paid account), and Continue reading: Code snippets – regular expressions http://dlvr.it/Sk5hMJ

Descubrir y aprender todo lo que hay que saber sobre los paquetes de R utilizando r-universe

صورة
Encontrar la herramienta adecuada para el trabajo Lo más difícil de usar R con eficacia es encontrar los mejores paquetes para el problema que intentas resolver. Creo que esto es incluso, más importante que dominar el lenguaje en sí. Cosa que irás ad... Continue reading: Descubrir y aprender todo lo que hay que saber sobre los paquetes de R utilizando r-universe http://dlvr.it/Sk3Hln

Diophantine riddle

The weekly riddle from The Riddler is to find solutions to the Diophantine equation c³-c=b²+4 (when b and c are positive integers). First, forget about ChatGPT since it states this is a Pell equation. With a wrong argument. Second, when running a basic R code, using as.double ... Continue reading: Diophantine riddle http://dlvr.it/Sk34kt

Asymptotic Statistics in Non-Sparse Networks

صورة
Exchangeable arrays have been studied since the late 70’s (Aldous (1983), Kallenberg (2005)). Eagleson and Weber (1978) and Silverman (1976) establish Strong Law of Large Numbers and Central Limit Theorems for such arrays. Because non-sparse networks and multiway clustering are related to exchangeable arrays, they have received recent attention in statistics and econometrics (Davezies, ... Continue reading: Asymptotic Statistics in Non-Sparse Networks http://dlvr.it/Sk2K9Z

These drinking glasses are too short!

صورة
These drinking glasses are too short! Some of my reinsurance and math teacher friends may remember that when I am out of town and having an adult beverage with friends, I have been known to stare at the drinking glass and say something like, "I don'... Continue reading: These drinking glasses are too short! http://dlvr.it/Sk11Gt

Transformations for compositional data by @ellis2013nz

Motivation In engaging with this Twitter thread four months ago, I discovered that there was a whole set of statistical methods that I knew nothing about - transforming data that is in the form of a simplex. Common examples of this sort of data would ... Continue reading: Transformations for compositional data by @ellis2013nz http://dlvr.it/Sk0qBd

Mapping a square to a circle

صورة
Denoting by \(F(z \,|\, m)\) the incomplete elliptic function of first kind, the function \[ \varphi(z) = -\sqrt{i} \, F\bigl(i \sinh^{-1}(\sqrt{i} \, z) \,|\, -1 \bigr). \] is a conformal mappi... Continue reading: Mapping a square to a circle http://dlvr.it/Sk0F7B

Data Preppers with {healthyR.ai}

Introduction There are many different methods that one can choose from in order to model their data. This brings with it a fundamental issue of how to prepare your data for the specified algorithm. With the [{healthyR.ai}] package there are many... Continue reading: Data Preppers with {healthyR.ai} http://dlvr.it/SjzD3M

NYED Data Explorer Shows 15 Years of Charter School Success

صورة
NYED Data Explorer filtered for “All Students” ELA Aggregated Annual Test Scores Introduction Three years ago, in the course of building personal projects in R using public data from Connecticut, I wrote How Does Stamford Charter School for Exce... Continue reading: NYED Data Explorer Shows 15 Years of Charter School Success http://dlvr.it/SjwK6Y

rOpenSci Champions Program Kick off

صورة
The Champions Program got off to a great start in 2023! We’re happy to report on the first couple of months in our first run of the rOpenSci Champions Program. In September 2022, we launched the program, advertising for both mentors and mentees... Continue reading: rOpenSci Champions Program Kick off http://dlvr.it/SjsD8n

Puntapié inicial de nuestro programa de campeonas y campeones

صورة
¡El programa de campeones arrancó con todo este 2023! Nos complace informar sobre los dos primeros meses de nuestro primer programa de campeones y campeonas. En septiembre de 2022 hicimos el lanzamiento del programa, anunciando el llamado a aplicar a... Continue reading: Puntapié inicial de nuestro programa de campeonas y campeones http://dlvr.it/Sjp71V

Pivoting in tidyr and data.table

We all need to pivot data at some point, so these are just some notes for my own benefit really, because gather and spread are no longer in favour within tidyr. I tended to only ever need gather, and nearly always relied on the same key and ... Continue reading: Pivoting in tidyr and data.table http://dlvr.it/Sjk05j

A Gentle and Applied Introduction to Rcpp workshop

Learn how to use Rcpp package, while contributing to charity! Join our workshop on A Gentle and Applied Introduction to Rcpp to improve your skills which is a part of our workshops for Ukraine series.  Here’s some more info:  Title: A Gentle and Applied Introduction to Rcpp Date: Thursday, ... Continue reading: A Gentle and Applied Introduction to Rcpp workshop http://dlvr.it/Sjf7Rg

Working with image data in R workshop

Learn how to work with image data in R! Join our workshop on working with image data in R which is a part of our workshops for Ukraine series.  Here’s some more info:  Title: Working with image data in R Date: Thursday, March 23rd, 15:00 – 17:00 CET (Rome, Berlin, Paris timezone) ... Continue reading: Working with image data in R workshop http://dlvr.it/Sjf7DJ

rOpenSci News Digest, February 2023

Dear rOpenSci friends, it’s time for our monthly news roundup! You can read this post on our blog. Now let’s dive into the activity at and around rOpenSci! rOpenSci HQ R-universe improvements! We have changed the preferred git repo name where you host your packages.json registry for ... Continue reading: rOpenSci News Digest, February 2023 http://dlvr.it/Sjc5fQ

Stop Using Fractional Logit

صورة
This post focuses on one of the more curious models in contemporary statistics, a specification for proportions that is either called fractional logit or quasi-Binomial. While in most cases in statistics, the evaluation of a model necessarily invol... Continue reading: Stop Using Fractional Logit http://dlvr.it/SjYPc6

Moving Average Plots with {healthyR.ts}

صورة
Introduction Are you interested in visualizing time series data in a clear and concise way? The R package {healthyR.ts} provides a variety of tools for time series analysis and visualization, including the ts_ma_plot() function. The ts_ma_plot()... Continue reading: Moving Average Plots with {healthyR.ts} http://dlvr.it/SjVSQv

Spreadsheets and robust backends: a love story?

صورة
The source of every data science project is a dataset or even multiple. In general, scientists prefer to share data using a spreadsheet. This allows to quickly explore, enter and modify data. Software developers on the other hand, prefer to build a... Continue reading: Spreadsheets and robust backends: a love story? http://dlvr.it/SjRXKF

Off to CRAN! {tidyAML}

Introduction Are you tired of spending hours tuning and testing different machine learning models for your regression or classification problems? The new R package {tidyAML} is here to simplify the process for you! tidyAML is a simple interface ... Continue reading: Off to CRAN! {tidyAML} http://dlvr.it/SjNnHn

Options to install R on a Raspberry Pi and other ARM systems

Install from the Operating System’s repositories Compile R from source Install R using rig Install R from the R4Pi Project This started out as a section in some of my other posts but as installation options started to pile up it began to take too much space to ... Continue reading: Options to install R on a Raspberry Pi and other ARM systems http://dlvr.it/SjKpZZ

From Bits to Words: A Tale Computing and Communication Languages

A ten-year-old kid was beginning his high school journey when he had to learn a third language. He was already learning two languages: his mother tongue Hindi and the common-speak English. But neither of them had prepared him for Sanskrit. Continue reading: From Bits to Words: A Tale Computing and Communication Languages http://dlvr.it/SjHpQd

tikzDevice v0.12.4

Yesterday tikzDevice version 0.12.4 made it unto CRAN and is now propagating to the mirrors. The tikzDevice package provides a graphics output device for R that records plots in a LaTeX-friendly format. The device transforms plotting commands issued b... Continue reading: tikzDevice v0.12.4 http://dlvr.it/SjFs3L

Creating and Predicting Fast Regression Parsnip Models with {tidyAML}

Introduction I am almost ready for a first release of my R package {tidyAML}. The purpose of this is to act as a way of quickly generating models using the parsnip package and keeping things inside of the tidymodels framework allowing users to s... Continue reading: Creating and Predicting Fast Regression Parsnip Models with {tidyAML} http://dlvr.it/SjCFfC

The Mandelbulb in R

صورة
In this post, I provide some R code which generates a mesh of the Mandelbulb, a well-known 3D fractal. The Mandelbulb is an isosurface, and I use the rmarchingcubes package to get a mesh of ... Continue reading: The Mandelbulb in R http://dlvr.it/Sj8XbS

What Does It Mean to Maintain a Package?

Part of what we aim to do at rOpenSci is nurture a community of package maintainers who help each other. In addition to support during package maintenance, we also want to support maintainers who wish to move on. Situations can change, and there may c... Continue reading: What Does It Mean to Maintain a Package? http://dlvr.it/Sj5pT1

New preferred repo name for r-universe registries

This post is part of a series of technotes about r-universe, a new umbrella project by rOpenSci under which we experiment with various ideas for improving publication and discovery of research software in R. As the project evolves, we will post update... Continue reading: New preferred repo name for r-universe registries http://dlvr.it/Sj4lB9

Passing a function from R to C++

The algebraicMesh function of my package cgalMeshes takes as inputs a trivariate polynomial \(P(x,y,z)\), a number \(\ell\) and some other parameters, and it returns a mesh of the isosurfa... Continue reading: Passing a function from R to C++ http://dlvr.it/Sj31rb

Data from the file drawer: Remembering case-sensitive and case-insensitive words

صورة
I’ve dug up an old, never published, dataset that I collected back in 2013. This dataset fairly cleanly shows that it’s harder to remember words correctly if you also have to remember the case of the letters. That is, if the shown word is B... Continue reading: Data from the file drawer: Remembering case-sensitive and case-insensitive words http://dlvr.it/Sj1ZTr

Cricket Weighted Batting Average in R

صورة
Hello, I hope you have your Yorkshire tea ready as today I am going to be exploring weighted averages using R. I used the code above to generate the table of the top 15 players by batting average in the 2022 county championship. Now the whole point of this blog is to ... Continue reading: Cricket Weighted Batting Average in R http://dlvr.it/ShzNxf

Fixing my broken VSCode setup for R

Here's how I set up the correct user settings and keybindings for using R in VSCode Continue reading: Fixing my broken VSCode setup for R http://dlvr.it/ShyKWT

Better and enhanced method of estimating Mallow’s Cp

Introduction In statistics, Mallows's Cp, named for Colin Lingwood Mallows, an English statistician, is used to assess the fit of a regression model that has been estimated using ordinary least squares. Models with a Mallows' Cp value near P+1 (i.e. ... Continue reading: Better and enhanced method of estimating Mallow’s Cp http://dlvr.it/ShxZFG

The Argument Matcher: A Function for Selecting the Right Arguments {tidyAML}

Introduction I am working on finishing up a few things with my new R package {tidyAML} before I release it to CRAN. One of those things is the ability of a user to build a model using a command that might be something like generate_model(). One ... Continue reading: The Argument Matcher: A Function for Selecting the Right Arguments {tidyAML} http://dlvr.it/ShwQYd

Simultaneous Optimization of Several Response Variables

صورة
All the code shown below is available in the repository of this publication: Simultaneous Optimization of Several Response Variables. The problem of optimizing several response variables When we perform an experiment in the laboratory it is usual t... Continue reading: Simultaneous Optimization of Several Response Variables http://dlvr.it/ShsrbB

Attributes in R Functions: An Overview

صورة
Introduction R is a powerful programming language that is widely used for data analysis, visualization, and machine learning. One of the features of R that makes it versatile and flexible is the ability to assign attributes to functions. Attribu... Continue reading: Attributes in R Functions: An Overview http://dlvr.it/ShrKY3

Dynamite for Causal Inference from Panel Data using Dynamic Multivariate Panel Models

صورة
Introduction Panel data contains measurements from multiple subjects measured over multiple time points. Such data can be encountered in many social science applications such as when analysing register data or cohort studies (for example). Often the ... Continue reading: Dynamite for Causal Inference from Panel Data using Dynamic Multivariate Panel Models http://dlvr.it/ShmTFd