المشاركات

عرض المشاركات من يوليو, 2022

Some takeaways from rstudio::conf 2022

صورة
Last week I participated in the annual RStudio conference which took place in Washington DC. As always, a great experience, with a lot of hex stickers. (Me at rstudio::conf2022, with a background of a hex stickers wall) Here are some of my takea... Continue reading: Some takeaways from rstudio::conf 2022 http://dlvr.it/SVqwkh

Soccer Analytics Tutorial: Scraping EPL Data using R (2022 Update)

صورة
Reading Time: 10 minutes This is an update to last year's web scraping tutorial. I focus on getting data from my beloved fbref.com for Manchester City, the English Premier League 2021-22 champions. This guide uses R programming. Continue reading: Soccer Analytics Tutorial: Scraping EPL Data using R (2022 Update) http://dlvr.it/SVqRsG

Access and map the latest monkeypox case data in R

صورة
The ongoing monkeypox outbreak in many countries outside of the endemic regions in Africa has drawn significant attention over the past months. Government agencies as well as academic and industry teams have released open-access databases to track c... Continue reading: Access and map the latest monkeypox case data in R http://dlvr.it/SVqBtL

Best Books for Data Analytics

The post Best Books for Data Analytics appeared first on finnstats. If you are interested to learn more about data science, you can find more articles here finnstats. Best Books for Data Analytics, are you seeking the best books about data analytics? If so, your quest is over here. We’... Continue reading: Best Books for Data Analytics http://dlvr.it/SVpYxx

Ice Cream for R Programmers

What and why ice cream is. This post - perhaps contrary to the headline - will not be about eating ice cream. It will, however, be about something we all do too. About something, we do too often, even though we shouldn’t. About debugging with a pri... Continue reading: Ice Cream for R Programmers http://dlvr.it/SVn1s6

Survivor Confessionals Data: Dataset showcase for {survivoR}

صورة
Confessionals loosely represent a player’s screen time where they talk strategy and replay events. It is an imperfect measure but […] The post Survivor Confessionals Data: Dataset showcase for {survivoR} appeared first on Dan Oehm | Gradient Descending. Continue reading: Survivor Confessionals Data: Dataset showcase for {survivoR} http://dlvr.it/SVmj8W

June 2022: “Top 40” New CRAN Packages

صورة
One hundred eighty-nine new packages made it to CRAN in June. Here are my “Top 40” selections in eleven categories: Computational Methods, Data, Ecology, Genomics, Machine Learning, Mathematics, Medicine, Statistics, Time Series, Utilities, and Visualizations. Computational Methods itp v1.2.0: Implements the interpolate, truncate, project root-finding algorithm developed by Oliveira & Takahashi (2021). The ... Continue reading: June 2022: “Top 40” New CRAN Packages http://dlvr.it/SVlDKL

Comparing Plastic Pollution: Modeling with Tidymodels and Variable Importance

صورة
Plastic wastes are one of the most dominant pollution factors in the environment as we live in an era where modern industries exist. The most dangerous part of this kind of pollution is mismanaged plastic waste, which ends up mixing into the ocean in the end. Our World in Data, ... Continue reading: Comparing Plastic Pollution: Modeling with Tidymodels and Variable Importance http://dlvr.it/SVkybN

How to extract a time series subset in R?

The post How to extract a time series subset in R? appeared first on finnstats. If you are interested to learn more about data science, you can find more articles here finnstats. How to extract a time series subset in R?, This article will teach you h... Continue reading: How to extract a time series subset in R? http://dlvr.it/SVkdBP

Bioinformatics Analysis of Omics Data with the Shell & R

صورة
Summary In this tutorial participants will learn about biological data analysis with R and Unix/Linux tools. We begin with an introduction to bioinformatics and omics data analysis, and conclude with the walkthrough of a simple bioinformatics workflow for aligning transcriptomic sequences with genomic data. Questions What is bioinformatics? What ... Continue reading: Bioinformatics Analysis of Omics Data with the Shell & R http://dlvr.it/SVhPq6

How to Set Axis Limits in ggplot2?

The post How to Set Axis Limits in ggplot2? appeared first on finnstats. If you are interested to learn more about data science, you can find more articles here finnstats. How to Set Axis Limits in ggplot2?, ggplot2 can frequently be used to set the axis bounds on a plot. ... Continue reading: How to Set Axis Limits in ggplot2? http://dlvr.it/SVgrjd

An introductory workshop in Shiny, August 3rd to 5th

This workshop aims to introduce people with basic R knowledge to develop interactive web applications with the Shiny framework. The course consists of a one-hour session, where we will demonstrate basic UI, reactive UI, CSS personalization and dashb... Continue reading: An introductory workshop in Shiny, August 3rd to 5th http://dlvr.it/SVgrfw

Announcing Quarto, a new scientific and technical publishing system

Today we’re excited to announce Quarto, a new open-source scientific and technical publishing system. Quarto is the next generation of R Markdown, and has been re-built from the ground up to support more languages and environments, as well as to ... Continue reading: Announcing Quarto, a new scientific and technical publishing system http://dlvr.it/SVgSnk

Posit – Why Rstudio is changing its name

صورة
RStudio has officially announced a name change from RStudio to Posit.  Why did RStudio change its name to Posit? For the past few years, Posit (formerly RStudio) has been shifting from R-exclusive tooling to a language agnostic ecosystem. Much to our enjoyment, we’ve seen the RStudio IDE grow to ... Continue reading: Posit – Why Rstudio is changing its name http://dlvr.it/SVdsSs

How to Use the sweep Function in R?

The post How to Use the sweep Function in R? appeared first on finnstats. If you are interested to learn more about data science, you can find more articles here finnstats. How to Use the sweep Function in R?, The sweep() function in R allows you to carry out various ... Continue reading: How to Use the sweep Function in R? http://dlvr.it/SVd4rL

RStudio is becoming Posit

Today we are very excited to announce that RStudio has a new name: Posit. This is a big change, and below we’ll talk about exactly why we are doing this and what it means. But first—why Posit? Posit is a real word that means to put forth an... Continue reading: RStudio is becoming Posit http://dlvr.it/SVcVKX

How to Set Up Quarto with Docker, Part 1: Static Content

صورة
Learn how to work with Quarto inside a Docker container so that you can render and serve HTML documents and projects with ease. Continue reading: How to Set Up Quarto with Docker, Part 1: Static Content http://dlvr.it/SVc6wm

Programming a simple minimax chess engine in R

صورة
Why write a chess engine in R? Chess has always fascinated me. This is a game with simple rules that takes a lot of practice to master. No information is hidden from the players: the game state is clearly visible on the board, but strategie... Continue reading: Programming a simple minimax chess engine in R http://dlvr.it/SVYXsn

How much has COVID cost the NYC Subway system in “lost fares”?

صورة
With things in NYC beginning to return to normal after two years of COVID I found myself thinking about how much money was lost in Subway fares in the 2+ years where people were working from home. Seeing an opportunity to mess around with some forecasting packages, I set out to ... Continue reading: How much has COVID cost the NYC Subway system in “lost fares”? http://dlvr.it/STrVhX

How to do Conditional Mutate in R?

The post How to do Conditional Mutate in R? appeared first on Data Science Tutorials How to do Conditional Mutate in R, It’s common to wish to add a new variable based on a condition to an existing data frame. The mutate() and case when() functions from the dplyr ... Continue reading: How to do Conditional Mutate in R? http://dlvr.it/STr4lm

How to write a function in R and apply it to a data frame using map functions from {purr}

صورة
Introduction In this post, the European Flights data set is used to illustrate defining a function in R and applying it to a data frame using map functions from {purr}. The full source for this blog post is available on GitHub. Setup Loading the R l... Continue reading: How to write a function in R and apply it to a data frame using map functions from {purr} http://dlvr.it/STpYzh

Indy UseR Growing as New Businesses like Insurance and Pharmaceuticals Move into Indianapolis

صورة
Shankar Vaidyaraman and Derrick Kearney sit down with R Consortium to talk about how the Indy UseR Group coped with the pandemic. They started moving towards online events early, how... The post Indy UseR Growing as New Businesses like Insurance and Pharmaceuticals Move into Indianapolis appeared first on R Consortium. Continue reading: Indy UseR Growing as New Businesses like Insurance and Pharmaceuticals Move into Indianapolis http://dlvr.it/STnMvf

rOpenSci Code of Conduct Update

We are pleased to announce the release of Version 2.4 of our Code of Conduct with changes based on the feedback of our community. What’s new? Greater detail about acceptable and unacceptable behaviors in online settings. First translation of the tex... Continue reading: rOpenSci Code of Conduct Update http://dlvr.it/STmtr1

Random Forest Machine Learning Introduction

The post Random Forest Machine Learning Introduction appeared first on Data Science Tutorials Random Forest Machine Learning, We frequently utilize non-linear approaches to represent the link between a collection of predictor factors and a response variable when the relationship between them is extremely complex. Classification and regression trees, often known ... Continue reading: Random Forest Machine Learning Introduction http://dlvr.it/STlr9Z

Input R Shiny: Shiny Input Examples with shiny.fluent

صورة
Creating a beautiful, professional, and user-approved UI plays a vital role in the success of any project. Applications developed using Shiny are no exception. The Shiny package comes with its own collection of inputs available for the user interface. But sometimes you need a little extra. A dash of the ... Continue reading: Input R Shiny: Shiny Input Examples with shiny.fluent http://dlvr.it/STk6nk

shapviz goes H2O

صورة
The "shapviz" package now plays well together with H2O. Continue reading: shapviz goes H2O http://dlvr.it/STjGNm

How to Use Mutate function in R

The post How to Use Mutate function in R appeared first on Data Science Tutorials How to Use Mutate function in R, This article demonstrates how to add additional variables to a data frame using R’s mutate() function. Artificial Intelligence Examples-Quick View – Data Science Tutorials How to Use Mutate ... Continue reading: How to Use Mutate function in R http://dlvr.it/SThGrn

Combine project management and data analysis using Tiddlywiki and RMarkdown

I have been looking for a simple and flexible tool to manage projects, take notes, and track results of data analysis with R, e.g. EverNote, OneNote, Trac, JIRA, github, RMarkdown, bookdown, etc. Recently, Tiddlywiki and Projectify pops up from rando... Continue reading: Combine project management and data analysis using Tiddlywiki and RMarkdown http://dlvr.it/SThGqK

Multi-state survival modeling of a Jira issues snapshot

Work items in a formal development process progress through a series of stages, e.g., starting at Open, perhaps moving to Withdrawn or Merged with another item, eventually reaching Development, and finishing at Done (with a few being Reopened, i.e., moving back to the start of the process). This ... Continue reading: Multi-state survival modeling of a Jira issues snapshot http://dlvr.it/STgg2l

The Ascribed Advantage

How much more do men earn doing the same job as women? Continue reading: The Ascribed Advantage http://dlvr.it/STfLd0

Convert Multiple Columns to Numeric in R

The post Convert Multiple Columns to Numeric in R appeared first on Data Science Tutorials Convert Multiple Columns to Numeric in R, Using the dplyr package, you can change many columns to numeric using the following techniques. The examples that follow demonstrate each technique in action. Calculate the p-Value from ... Continue reading: Convert Multiple Columns to Numeric in R http://dlvr.it/STdRxF

Rcpp 1.0.9 on CRAN: Regular Updates

The Rcpp team is please to announce the newest release 1.0.9 of Rcpp which hit CRAN late yesterday, and has been uploaded to Debian as well. Windows and macOS builds should appear at CRAN in the next few days, as will builds in different Linux distr... Continue reading: Rcpp 1.0.9 on CRAN: Regular Updates http://dlvr.it/STcfyT

Colorful R Plots with Wes Anderson Palettes – Pirate Ships

صورة
Overview Adding color to your plots is a great way to make them more visually appealing and informative. Not to mention the fun you can have playing with color palettes that have been made for ggplot2, like the Wes Anderson palette in the wesanderson R package by karthik. In this ... Continue reading: Colorful R Plots with Wes Anderson Palettes – Pirate Ships http://dlvr.it/STb1M2

Containerizing Interactive R Markdown Documents

صورة
R Markdown is a reproducible authoring format supporting dozens of static and dynamic output formats. Let's review why and how you should containerize Rmd files. Continue reading: Containerizing Interactive R Markdown Documents http://dlvr.it/STb1HT

Select the First Row by Group in R

The post Select the First Row by Group in R appeared first on Data Science Tutorials Select the First Row by Group in R, using the dplyr package in R, you might wish to choose the first row in each group frequently. To do this, use the simple syntax shown ... Continue reading: Select the First Row by Group in R http://dlvr.it/STYfkN

How to Create a Frequency Table by Group in R?

The post How to Create a Frequency Table by Group in R? appeared first on Data Science Tutorials How to Create a Frequency Table by Group in R?, To produce a frequency table by the group in R, use the dplyr package’s following functions. Reorder Boxplots in R with ... Continue reading: How to Create a Frequency Table by Group in R? http://dlvr.it/STYffR

How to Calculate Relative Frequencies in R?

The post How to Calculate Relative Frequencies in R? appeared first on Data Science Tutorials How to Calculate Relative Frequencies in R?, The relative frequencies/proportions of values in one or more columns of a data frame can frequently be calculated in R. Data Science Statistics Jobs  » Are you looking ... Continue reading: How to Calculate Relative Frequencies in R? http://dlvr.it/STWVvg

How many languages do we need to learn about responsible machine learning? useR! 2022 Conference

صورة
It might seem that, we don’t have much choice, because the most popular languages in data science are R and Python or if you prefer Python and R. But today we are not talking about these languages!During the useR!2022 conference, we can meet with Engli... Continue reading: How many languages do we need to learn about responsible machine learning? useR! 2022 Conference http://dlvr.it/STW0ZS

One-sample Wilcoxon test in R

صورة
Introduction When? Data How? Combine statistical test and plot References Introduction In a previous article, we showed how to do a two-sample Wilcoxon test in R. Remember that there are actually two versions of this test: The Mann-Whitney-W... Continue reading: One-sample Wilcoxon test in R http://dlvr.it/STW0Tq

How I Use Stories to Share Data at Meetings

صورة
Ryan Estrellado is a writer and educator. He is the author of the book The K–12 Educator’s Data Guidebook: Reimagining Practical Data Use in Schools and a co-author of Data Science in Education Using R. Ryan tells inspiring stories about the realit... Continue reading: How I Use Stories to Share Data at Meetings http://dlvr.it/STT2lf

Get tickets for An introductory course in Shiny on July 11th & 13th or 12th & 14th at 30 USD

An introductory course in Shiny This course aims to introduce people with basic R knowledge to develop interactive web applications using the Shiny framework. The course consists of two days, one-hour session per day, where we will discuss topics s... Continue reading: Get tickets for An introductory course in Shiny on July 11th & 13th or 12th & 14th at 30 USD http://dlvr.it/STSkQw

Readable code with base R (part 2)

Combine startsWith and endsWith with R’s pipe In my first post, startsWith and endsWith were presented. In combination with R’s pipe operator, we can improve the readability even further. w startsWith("H... Continue reading: Readable code with base R (part 2) http://dlvr.it/STSK22

Reshaping data frames using pivot functions from {tidyr} and tally from {dplyr}

صورة
Introduction In this post, the San Francisco Rentals data set is used to demonstrate data reshaping in R. This involves changing the number of columns and rows in a data frame to fit a given use case. A data frame is made more tall or narrow by decr... Continue reading: Reshaping data frames using pivot functions from {tidyr} and tally from {dplyr} http://dlvr.it/STQ5gC

RStudio Recap From the Appsilon Shiny Conference

صورة
In April 2022, our Full Service partner Appsilon hosted the first-ever Shiny Conference. The conference comprised three days of free, online Shiny content ranging from tips and tricks from the experts, to fascinating community case studies, to examples... Continue reading: RStudio Recap From the Appsilon Shiny Conference http://dlvr.it/STPqk5

Get tickets for An introductory course in Shiny on July 6th and 8th at 10 USD

The remaining tickets for An introductory course in Shiny on July 6th and 8th are for sale at a discounted price (10 USD). Check https://www.buymeacoffee.com/pacha/e/77320. Continue reading: Get tickets for An introductory course in Shiny on July 6th and 8th at 10 USD http://dlvr.it/STPqgY

Virtual Events Opened New Horizons for R-Ladies Dallas

صورة
R-Consortium talked to Dr. Sydeaka Watson of the Dallas, Texas chapter of R-Ladies Global about turning the challenges of the pandemic into opportunities for the group. She shared the initial... The post Virtual Events Opened New Horizons for R-Ladies Dallas appeared first on R Consortium. Continue reading: Virtual Events Opened New Horizons for R-Ladies Dallas http://dlvr.it/STPWJV

Bayes Rules! [book review]

صورة
Bayes Rules! is a new introductory textbook on Applied Bayesian Model(l)ing, written by Alicia Johnson (Macalester College), Miles Ott (Johnson & Johnson), and Mine Dogucu (University of California Irvine). Textbook sent to me by CRC Press for review. It is available (free) online as a website and has a ... Continue reading: Bayes Rules! [book review] http://dlvr.it/STLjC1

Probabilistic forecasting for the UEFA Women’s Euro 2022

صورة
Using a consensus model based on quoted bookmakers' odds winning probabilities for all competing teams in the UEFA Women's Euro are obtained: The favorite is Spain, followed by host England, France, and the Netherlands as the defending champ... Continue reading: Probabilistic forecasting for the UEFA Women’s Euro 2022 http://dlvr.it/STJlgt

Nelson-Siegel-Svensson Yield Curve model using R code

صورة
This post introduces Nelson-Siegel-Svensson (NSS) yield curve model which is an extension of Nelson-Siegel (NS) model with an additional curvature factor. It aims to fit longer term maturities well. Nelson-Siegel-Svensson model Svens... Continue reading: Nelson-Siegel-Svensson Yield Curve model using R code http://dlvr.it/STHsjb

Slight inconsistency between forcats’ fct_lump_min and fct_lump_prop

I recently noticed a slight inconsistency between the forcats package’s fct_lump_min and fct_lump_prop functions. (I’m working with v0.5.1, which is the latest version at the time of writing.) These functions lump levels that meet a certain criteria into an “other” … Continue reading → Continue reading: Slight inconsistency between forcats’ fct_lump_min and fct_lump_prop http://dlvr.it/STGD7P

Model-Based Causal Forests for Heterogeneous Treatment Effects

صورة
A new arXiv paper investigates which building blocks of random forests, especially causal forests and model-based forests, make them work for heterogeneous treatment effect estimation, both in randomized trials and observational studies. ... Continue reading: Model-Based Causal Forests for Heterogeneous Treatment Effects http://dlvr.it/STDgzc

A Major Contribution to Learning R

Prominent statistician Frank Harrell has come out with a radically new R tutorial, rflow. The name is short for “R workflow,” but I call it “R in a box” –everything one needs for beginning serious usage of R, starting from little or no background. By serious usage I mean real ... Continue reading: A Major Contribution to Learning R http://dlvr.it/STCb5t

Evaluating GitHub Activity for Contributors

Say you have a bug report or feature request to make to a package. How can you use information on GitHub to manage your expectations (will there be a quick fix) and actions (should you go ahead and fork the repository)? In this post, we shall go over ... Continue reading: Evaluating GitHub Activity for Contributors http://dlvr.it/ST9R9D

An introductory course in Shiny – July sessions

I’m offering this course to learn Shiny, and I’ve put this with a 50% discount for 48 hours. This course aims to introduce people with basic R knowledge to develop interactive web applications using the Shiny framework. The course consists of two da... Continue reading: An introductory course in Shiny – July sessions http://dlvr.it/ST90M3