المشاركات

عرض المشاركات من ديسمبر, 2022

New Statistics Tutorial

I’ve recently completed fastStat, https://github.com/matloff/fastStat,a quick introduction to statistics for those who’ve had a calculus-based probability course. Many such people later need to do statistics, and this will give them quick access. It is modeled after my R tutorial, https://github.com/matloff/... Continue reading: New Statistics Tutorial http://dlvr.it/Sg9yl7

Using Emacs for R

صورة
Easy Emacs To start using R, or almost anything else in Emacs you basically need to know 3 things: 1) How to move in Emacs, meaning understanding what is what and learning a few key commands; 2) What is the configuration file and how to use it and 3) H... Continue reading: Using Emacs for R http://dlvr.it/Sg9KRr

Load Multiple Packages in R

The post Load Multiple Packages in R appeared first on Data Science Tutorials What do you have to lose?. Check out Data Science tutorials here Data Science Tutorials. Load Multiple Packages in R, The following example demonstrates how to apply this syn... Continue reading: Load Multiple Packages in R http://dlvr.it/Sg6Zs2

Bayesian workflow: Prior determination, predictive checks and sensitivity analyses

صورة
This post presents a code-through of a Bayesian workflow in R, which can be reproduced using the materials at https://osf.io/gt5uf. The content is closely based on Bernabeu (2022), which was in turn based on lots of other references. In addition to those, you may wish to consider ... Continue reading: Bayesian workflow: Prior determination, predictive checks and sensitivity analyses http://dlvr.it/Sg4Xkb

Plotting two-way interactions from mixed-effects models using ten or six bins

Whereas the direction of main effects can be interpreted from the sign of the estimate, the interpretation of interaction effects often requires plots. This task is facilitated by the R package sjPlot (Lüdecke, 2022). In Bernabeu (2022), the sjPlot ... Continue reading: Plotting two-way interactions from mixed-effects models using ten or six bins http://dlvr.it/Sg1z3N

Twitcher II: tweet frequency and top tweets

صورة
Another post looking at Twitter data in R. It follows this one and this one. I wanted to look again at my tweeting frequency over the 12 years on Twitter, but this time do it in a calendar view. Something like a GitHub commit calendar would be perfect. I have used ... Continue reading: Twitcher II: tweet frequency and top tweets http://dlvr.it/SfyktT

Were there too many unlikely results at the FIFA World Cup 2022 in Qatar?

صورة
FIFA World Cup 2022 in Qatar saw many surprising results. In fact, too many – some would argue. From the unbelievable loss of Argentina to Saudi Arabia at the very beginning of the group stage, via the loss of the magnificent Brazil to Cameroon ... Continue reading: Were there too many unlikely results at the FIFA World Cup 2022 in Qatar? http://dlvr.it/SfxKYd

Computing Win-Probability of T20 matches

صورة
I am late to the ‘Win probability’ computation for T20 matches, but managed to jump on to this bus with this post. Win Probability analysis and computation have been around for some time and are used in baseball, NFL, soccer hockey and others. On T20 cricket, the following posts from ... Continue reading: Computing Win-Probability of T20 matches http://dlvr.it/SfvzS7

What is survival analysis? Examples by hand and in R

صورة
Introduction What is survival analysis? Why do we need special methods for survival analysis? Common functions in survival analysis Survival function Cumulative hazard function Hazard function Estimation By hand In R Hypothesis testing Log-ran... Continue reading: What is survival analysis? Examples by hand and in R http://dlvr.it/Sft3pC

R-Ladies Cologne – Our first year in the books!

صورة
Thank you all for a fantastic year at R-Ladies Cologne! We had a series of great events which gave us the chance to meet so many wonderful and kindhearted people across the globe. This is one thing that always makes me so thankful to be part of the com... Continue reading: R-Ladies Cologne – Our first year in the books! http://dlvr.it/SfqTW3

Export in Bananen in Tonnen von 1994-2005 (Banana exports in tonnes from 1994-2005)

A friend who doesn’t use the Tidyverse send me this very nice plot: My first intuition to obtain the data for this unidentified plot was to go to FAO, and it was there! I went to FAO Stat, filtered the countries and years seen in the plot and I ... Continue reading: Export in Bananen in Tonnen von 1994-2005 (Banana exports in tonnes from 1994-2005) http://dlvr.it/SfqTFD

Code longevity of the R programming language

صورة
I’ve been working on a way to evaluate how old R code runs on the current version of R, and am now ready to share some results. It all started with this tweet: The problem is that you have to find old code laying around. Some people have found ... Continue reading: Code longevity of the R programming language http://dlvr.it/SfmZFL

{drawCell} – R package and Shiny app for drawing cell structures

صورة
Are you a life sciences educator looking to engage students with interactivity or a student needing to draw and label cells? Are you a researcher looking to offload the tedium of data visualization? There’s an R solution for you: drawCell! This tool provides a convenient, engaging solution for educators ... Continue reading: {drawCell} – R package and Shiny app for drawing cell structures http://dlvr.it/SfjZ6Y

Hierarchical clustering, cutting the tree and colouring the tree leaves based on sample classes

صورة
Unsupervised machine learning methods such as hierarchical clustering allow us to discover the trends and patterns of similarity within the data. Here, I demonstrate by using a test data, how to apply the Hierarchical clustering on columns of a test da... Continue reading: Hierarchical clustering, cutting the tree and colouring the tree leaves based on sample classes http://dlvr.it/SffVQk

demographeR’s notes

صورة
2022 No matching items 2021 No matching items 2020 No matching items 2019 No matching items 2018 No matching items 2017 February 5, 2017 ... Continue reading: demographeR’s notes http://dlvr.it/SfbSzw

Still young, ChatGPT surprises with its abilities

While the internet is flooding with mentions of what ChatGPT failed to do, what it can do is surprisingly advanced. Below are two such examples. It is not an argument that validates (or invalidates) abilities of the AI. Couple of de... Continue reading: Still young, ChatGPT surprises with its abilities http://dlvr.it/SfbSly

R packages for visualising spatial data

صورة
Throughout November I took part in the #30DayMapChallenge - a daily mapping, cartography, and data visualization challenge aimed at the spatial community. You can read my recap of the challenge in the blog I wrote last week, if you’re intereste... Continue reading: R packages for visualising spatial data http://dlvr.it/SfYHmP

Google Analytics in R: Review of 2022

صورة
Introduction Prerequisites Analytics Page views over time Page views per month and year Top performing pages Page views by country Page views per day of week Thank you note Introduction It is almost the end of the year, which means it is tim... Continue reading: Google Analytics in R: Review of 2022 http://dlvr.it/SfYHWh

SQL Databases for Students and Educators

Publicly accessible databases often impose query limits or require registration. I’ve decided to host different light/medium size by using PostgreSQL at databases.pacha.dev. If you need access to the online DBs (i.e., not configuring your local copy... Continue reading: SQL Databases for Students and Educators http://dlvr.it/SfWBDn

Gosling: Interactive Genomics Charts in R Shiny

صورة
Visualizing genomics data is not as straightforward as you would think. The whole industry is extremely complex and suffers from a major problem – specialization creep. There are so many niches in genomics, meaning one library or service can’t cover the entire spectrum of genomics data visualization needs. That is, ... Continue reading: Gosling: Interactive Genomics Charts in R Shiny http://dlvr.it/SfSKfq

{Proverbs} Now Has Support for ESV Translation

صورة
By: Brad Lindblad LinkedIn | Github | Blog | Subscribe The proverbs R package now has support for the incredibly popular ESV Bible translation. View your daily proverb in this fluent and literal translation from the comfort of your R session. You do h... Continue reading: {Proverbs} Now Has Support for ESV Translation http://dlvr.it/SfPh6G

Progress on R-spatial evolution, Dec 2022

Package dependency tables sp evolution status Using spdep as a testbed Deprecations in retiring packages Script analysis from The Dataverse Project [view raw Rmd] Summary: This is the second report on the R-spatial evolution project. The... Continue reading: Progress on R-spatial evolution, Dec 2022 http://dlvr.it/SfMcmY

Advent of 2022, Day 14 – Registering the models

صورة
In the series of Azure Machine Learning posts: Important asset is the “Models” in navigation bar. This feature allows you to work with different model types -__ custom, MLflow, and Triton. What you do here is, you register a model…Read more › Continue reading: Advent of 2022, Day 14 – Registering the models http://dlvr.it/SfLcYs

2022-03 Offsetting Lines and Polygons in ‘grid’

This document describes new functions in the ‘gridGeometry’ package for R that generate offset regions and Minkowski Sums for lines and polygons. Jack Wong and ... Continue reading: 2022-03 Offsetting Lines and Polygons in ‘grid’ http://dlvr.it/SfKp9v

Touching the 3rd Rail of Data Science: “R or Python?”

I’ve been seeing a lot of hot takes on if one should do data science in R or in Python. I’ll comment generally on the topic, and then add my own myopic gear-head micro benchmark. I’ll jump in: If learning the language is the big step: then ... Continue reading: Touching the 3rd Rail of Data Science: “R or Python?” http://dlvr.it/SfKXdf

Advent of 2022, Day 13 – Automated ML

صورة
In the series of Azure Machine Learning posts: Automated ML is a no-code automated machine learning task. It iterates over many combinations of algorithms and hyperparameters in order to find the best model for your dataset and your prediction variable(s).…Read more › Continue reading: Advent of 2022, Day 13 – Automated ML http://dlvr.it/SfJTrs

Modeling the secular trend in a stepped-wedge design

صورة
Recently I started a discussion about modeling secular trends using flexible models in the context of cluster randomized trials. I’ve been motivated by a trial I am involved with that is using a stepped-wedge study design. The initial post focused o... Continue reading: Modeling the secular trend in a stepped-wedge design http://dlvr.it/SfGytL

Hillshade, colors and marginal plots with tidyterra (II)

صورة
Add marginal plots to a SpatRaster map on ggplot2 Continue reading: Hillshade, colors and marginal plots with tidyterra (II) http://dlvr.it/SfG6sd

Redesigning Dashboards with Shiny and Rhino: World Bank’s Carbon Pricing

صورة
The modern revolution of big data has led to a boom in business intelligence. But data is only as valuable as the insights it provides. That means the true value of data is in exposing hidden insights. But if the way you share your data is slow, unscalable, or poorly ... Continue reading: Redesigning Dashboards with Shiny and Rhino: World Bank’s Carbon Pricing http://dlvr.it/SfCPfL

MIDI madness with ChatGPT: the AI-powered tunes that will make you laugh, cry, and dance

ChatGPT seems to be taking the world by storm. This is version of the GPT3 language model which is somehow optimised for chat dominates my Mastodon feed and inspired countless articles and discussion. 1 A decent chunk of the discourse has been about how the outputs of the models sound very ... Continue reading: MIDI madness with ChatGPT: the AI-powered tunes that will make you laugh, cry, and dance http://dlvr.it/SfCBpk

Installing Quarto-CLI on Linux-arm64 systems

صورة
Testing Platform Installing Dependencies Installing Quarto Quarto is Posit’s new open-source scientific and technical publishing system built on Pandoc that allows you to weave together narrative text and code to produce high-quality outputs including reports, presentations, websites, and more. It’s the successor of the highly successful, but R ... Continue reading: Installing Quarto-CLI on Linux-arm64 systems http://dlvr.it/Sf7n3G

The effect of Childhood Education on Wealth: Modeling with Bayesian Additive Regression Trees (BART)

صورة
Recently, I read a tweet that reports children who grow up in poor conditions earn much less as adults than those with better conditions. I believe one of the best ways to check that is to compare childhood education participation rates with wealth in related countries; in order to do ... Continue reading: The effect of Childhood Education on Wealth: Modeling with Bayesian Additive Regression Trees (BART) http://dlvr.it/Sf4zHz

How to test the significance of a mediation effect

The post How to test the significance of a mediation effect appeared first on Data Science Tutorials What do you have to lose?. Check out Data Science tutorials here Data Science Tutorials. How to test the significance of a mediation effect?, To perform a sobel test in R is covered ... Continue reading: How to test the significance of a mediation effect http://dlvr.it/Sf21w7

How to Save ggplot2 Plots in a targets Workflow?

صورة
I really enjoy using targets for all of my data analysis projects, especially because it helps me structure all of the projects nicely in the same folder. For targets projects, I often produce several figures using ggplot2. However, there are no formal recommendations for saving ggplot2 objects (as opposed to ... Continue reading: How to Save ggplot2 Plots in a targets Workflow? http://dlvr.it/SdyyZz

Just How Good Is ChatGPT in Data Science?

Many of you may have heard of ChatGPT, a dazzling new AI tool. We are hearing lots of gushing praise for the tool. Well, how well does it do in data science contexts? I tried a few queries here, and found interesting results. I first requested, “Write an R function ... Continue reading: Just How Good Is ChatGPT in Data Science? http://dlvr.it/SdvwkD

Comparison of Partition Around Medoid R programming Implementations

صورة
Back in September 2016 I implemented the ClusterR package. One of the algorithms included in ClusterR was the ‘Partition Around Medoids’ (Cluster_Medoids) algorithm which was based on the paper “Anja Struyf, Mia Hubert, Peter J. Rousseeuw, (F... Continue reading: Comparison of Partition Around Medoid R programming Implementations http://dlvr.it/SdrtZM

Gaussian Process Regression for FEA Designed Experiments – Building the Basics in R

صورة
A Google search for ‘Gaussian Process Regression’ returns some intimidating material for a non-statistician. After filtering away the obscure stuff I’ll never understand and digging around within the code that makes GPR happen, I’m proud to say that I feel I’ve gotten my arms around the basics ... Continue reading: Gaussian Process Regression for FEA Designed Experiments – Building the Basics in R http://dlvr.it/SdpjTC

A Community Gathering in Oxford for Learning about R and Networking Opportunities

صورة
The Oxford R useR Group organizers are leading the R meetup effort in Oxford, UK. The group has over 600 subscribers and brings together __20 R enthusiasts together monthly in... The post A Community Gathering in Oxford for Learning about R and Networking Opportunities appeared first on R Consortium. Continue reading: A Community Gathering in Oxford for Learning about R and Networking Opportunities http://dlvr.it/SdmcmJ

The coefficient of determination: is it the R-squared or r-squared?

We often use the coefficient of determination as a swift ‘measure’ of goodness of fit for our regression models. Unfortunately, there is no unique symbol for such a coefficient and both \(R^2\) and \(r^2\) are used in literature, almost interchangea... Continue reading: The coefficient of determination: is it the R-squared or r-squared? http://dlvr.it/SdjmBn

How to perform TBATS Model in R

The post How to perform TBATS Model in R appeared first on Data Science Tutorials What do you have to lose?. Check out Data Science tutorials here Data Science Tutorials. How to perform TBATS Model in R?, The term “TBATS” refers to a well-liked time series forecasting technique and stands ... Continue reading: How to perform TBATS Model in R http://dlvr.it/SdfhGk