المشاركات

عرض المشاركات من يناير, 2022

What song lyric sentiment analysis doesn’t tell you

This is an exercise in trying out the rolldown package. Continue reading: What song lyric sentiment analysis doesn’t tell you http://dlvr.it/SJ8qwh

#RObservations #24: Using Tesseract-OCR to Scan Bank Documents and Extract Relevant Data

صورة
Introduction I have clearly been out of the loop because I have only recently learned about the tesseract library in R. If I knew about it earlier I would have wrote about it much sooner! The tesseract library is a package which has bindings to the Tesseract-OCR engine: a powerful ... Continue reading: #RObservations #24: Using Tesseract-OCR to Scan Bank Documents and Extract Relevant Data http://dlvr.it/SJ8G7r

Murcia R Users Group (UMUR) in Spain Didn’t Let the Pandemic Break its Momentum

صورة
R Consortium recently talked with Aurora González Vidal of UMUR Asociación de Usuarios de R Murcia (Also on Twitter). She covered the historic involvement of Murcia in the evolution of... The post Murcia R Users Group (UMUR) in Spain Didn’t Let the Pandemic Break its Momentum appeared ... Continue reading: Murcia R Users Group (UMUR) in Spain Didn’t Let the Pandemic Break its Momentum http://dlvr.it/SJ7qsd

A Blend of Package Build Failures

The rOpenSci R-universe is a bit special as, compared to other R-universes, it builds docs for all the packages in our suite. Looking at the dashboard helps us identify failures in building the packages as well as in building the pkgdown websites. We ... Continue reading: A Blend of Package Build Failures http://dlvr.it/SJ6pmT

Trying to reproduce a map of the greater Paris area from APUR with ggplot2, a moment for coloring

صورة
In 2016, APUR the Paris urbanism agency published a map that described new Greater Paris area, see here. At this time I was interested to reproduce this map in R with official datas/shapefiles and ggplot2. A moment for coloring. I put this code ... Continue reading: Trying to reproduce a map of the greater Paris area from APUR with ggplot2, a moment for coloring http://dlvr.it/SJ5KqF

SQL for Data Science Beginners Guide

The post SQL for Data Science Beginners Guide appeared first on finnstats. If you want to read the original article, click here SQL for Data Science Beginners Guide. SQL for Data Science Beginners Guide, SQL, or Structured Query Language, can be used by data science professionals to retrieve, manipulate and ... Continue reading: SQL for Data Science Beginners Guide http://dlvr.it/SJ4Sgz

Announcing the Summer Institute in Computational Social Science (Covenant University, Nigeria)

صورة
SICSS-Nigeria is accepting applications to participate in the two-week-long Summer Institute in Computational Social Science that will take place on June 19-29, 2022, at Covenant University (Nigeria). SICSS-Nigeria will bring together postgraduate students, early-career academics and researchers, as well as junior officers in statistical offices, government ministries, and agencies. Application/registration ... Continue reading: Announcing the Summer Institute in Computational Social Science (Covenant University, Nigeria) http://dlvr.it/SJ2tFF

How to Win the RStudio Shiny Contest

صورة
This is a guest post from Marcin Dubel, a 2021 Shiny Contest Grand Prize winner and Software Engineer at Appsilon, a Full Service RStudio Partner. RStudio’s 4th annual Shiny Contest is just around the corner. As a winner of last year’s contest with S... Continue reading: How to Win the RStudio Shiny Contest http://dlvr.it/SJ2ZcD

Hopf torus with dynamic colors

صورة
In a recent post I explained how to decorate a surface with moving colors with the Python library PyVista. Here I expose this method for the R package rgl. I will take a Hopf... Continue reading: Hopf torus with dynamic colors http://dlvr.it/SJ1t1P

Call for Abstracts: Appsilon Shiny Conference 2022

صورة
Appsilon invites you to submit presentation abstracts for the all-virtual Appsilon Shiny Conference on April 27-29, 2022. All talks relating to R Shiny are acceptable to consideration. These include talks on a Shiny app/dashboard you’ve created, packages you have developed, use-cases of Shiny in your business or research projects, ... Continue reading: Call for Abstracts: Appsilon Shiny Conference 2022 http://dlvr.it/SJ03DR

‘gifski’ as a bash command using R

The gifski command-line utility is a great tool to make a GIF animation from a series of png files. At my work I’m using a laptop with Windows 10 and I don’t have admin rights. I don’t know how to install... Continue reading: ‘gifski’ as a bash command using R http://dlvr.it/SHzbjB

Making a package from base R files

John C. Nash, retired professor, Telfer School of Management, University of Ottawa Arkajyoti Bhattacharjee, Department of Mathematics and Statistics, Indian Institute of Technology, Kanpur based on communications from Duncan Murdoch 11/06/2021, revised 19/01/2022 Background This article tries to explain an approach to developing alternative versions of functions which are in the distributed ... Continue reading: Making a package from base R files http://dlvr.it/SHxTLn

Happy birthday easystats! A retrospective

صورة
Happy birthday easystats! Two years ago, which feels like yesterday, we celebrated the easystats project’s first anniversary. Wow, those were simpler times! One could travel for pleasure, party with dozens of people and have a face-to-face conver... Continue reading: Happy birthday easystats! A retrospective http://dlvr.it/SHv9Qg

DataCamp Competition – Was a website redesign successful

“🧑If first you don’t succeed, try two or more times so that your failure is statistically significant” 📖 Problem Statement An early-stage start up in Germany has been working on a website redesign of their landing page. The team believes a ne... Continue reading: DataCamp Competition – Was a website redesign successful http://dlvr.it/SHt28r

RTutor: Gasoline Taxes and Consumer Behavior

How do consumers react to an increase in gasoline taxes? How much will they drive less? Will they buy more fuel efficient cars? Do tax increases have a stronger impact than gasoline price increases from other sources, like higher oil prices? In their ... Continue reading: RTutor: Gasoline Taxes and Consumer Behavior http://dlvr.it/SH7B4F

Shinywordle: A shiny app to solve the game Worldle and the power of regular expressions

I’ve created an app, Shinywordle. The use of regular expressions (regex) to solve the game is interesting. As an applied statistician I can’t consider myself a regex expert, but these have helped me a lot when working with non-structured data such ... Continue reading: Shinywordle: A shiny app to solve the game Worldle and the power of regular expressions http://dlvr.it/SH7B2f

Clipping an isosurface to a ball, and more

صورة
We will firstly show how to clip an isosurface to a ball with R, and then, more generally, how to clip a surface to an arbitrary region. In the last part we show how to achieve the same with Python. ... Continue reading: Clipping an isosurface to a ball, and more http://dlvr.it/SH67Mt

Universal estimation with Maximum Mean Discrepancy (MMD)

صورة
This is an updated version of a blog post on RIKEN AIP Approximate Bayesian Inference team webpage: https://team-approx-bayes.github.io/blog/mmd/ INTRODUCTION A very old and yet very exciting problem in statistics is the definition of a universal estimator \(\hat{\theta}\). An estimation procedure that would work all ... Continue reading: Universal estimation with Maximum Mean Discrepancy (MMD) http://dlvr.it/SH3hnP

{emayili} HTML Messages with Images

صورة
No two email clients are equal. Nowhere is this more true than in the way that they treat images in HTML messages. Some clients are fairly permissive. Thunderbird, for example, will happily display images in an HTML message if the images are included in any of the following ways: a ... Continue reading: {emayili} HTML Messages with Images http://dlvr.it/SH38D4

rOpenSci 2021 Code of Conduct Transparency Report

The rOpenSci community is supported by our Code of Conduct with a clear description of unacceptable behaviors, instructions on how to make a report, and information on how reports are handled. We, the Code of Conduct Committee, are responsible for rec... Continue reading: rOpenSci 2021 Code of Conduct Transparency Report http://dlvr.it/SH2chk

Salt Lake City R User Group Looks to Meld In-person and Online Activities

Salt Lake City R User Group noticed that their users were missing something during their online meetings. R Consortium talks to Julia Silge about how they planned on mixing online... The post Salt Lake City R User Group Looks to Meld In-person and Online Activities appeared first on R Consortium. Continue reading: Salt Lake City R User Group Looks to Meld In-person and Online Activities http://dlvr.it/SH1G8T

Handling Categorical Data in R – Part 2

صورة
This is part 2 of a series on “Handling Categorical Data in R where we are learning to read, store, summarize, visualize & manipulate categorical data..” In part 1 of this series, we understood what categorical data is, how R stores it using fa... Continue reading: Handling Categorical Data in R – Part 2 http://dlvr.it/SH0n0k

How To Connect R Shiny to Postgres Database – The Definite Guide

صورة
Managing database connections can be messy at times. It’s always easier to read and write to local CSV files. That doesn’t mean it’s the right thing to do, as most production environments have data stored in one or multiple databases. As a data professional, you must know ... Continue reading: How To Connect R Shiny to Postgres Database – The Definite Guide http://dlvr.it/SGxnmY

Index Names and lapply Function in R

The post Index Names and lapply Function in R appeared first on finnstats. If you want to read the original article, click here Index Names and lapply Function in R. Index Names and lapply Function in R,  This post, will show you how to use list indices in R’s ... Continue reading: Index Names and lapply Function in R http://dlvr.it/SGwmh5

Gather on the rOpenSci Forum

صورة
Do you have an account on the rOpenSci forum? As underlined in our contributing guide, our forum is where we encourage Q&A and exploration of ideas on a various range of topics. Compared to our Slack semi-open workspace, the forum is entirely open... Continue reading: Gather on the rOpenSci Forum http://dlvr.it/SGvhNS

How to use the dollar sign ($) in R

The post How to use the dollar sign ($) in R appeared first on finnstats. If you want to read the original article, click here How to use the dollar sign ($) in R. How to use the dollar sign in R, You’ll learn how to use the $ operator in the ... Continue reading: How to use the dollar sign ($) in R http://dlvr.it/SGtglK

Wordle Words and Expected Value

صورة
Like many people, I’ve gotten sucked into wordle. For those unfamiliar with the game, you are tasked with identifying a five-letter word. You input a guess (which must be a five-letter word) and are told whether each letter in your guess is: not... Continue reading: Wordle Words and Expected Value http://dlvr.it/SGrmRZ

blind monty hall

As I was waiting for my boat on a French Guiana beach last week, I thought back about a recent riddle from The Riddler where an item does a random walk over a sequence of N integers. Behind doors. The player opens a door at the same rate as the ... Continue reading: blind monty hall http://dlvr.it/SGqYfr

A new way to discover R Programming books!

With over 250 books (and counting) in the Big Book of R collection, you could easily overlook a lot of useful ones. Sometimes you see a book and think “Hey that’ll be pretty useful if I ever work on [subject]” but when that time comes you’ve completely forgotten about ... Continue reading: A new way to discover R Programming books! http://dlvr.it/SGqJj0

Finally understanding what “statistical significance” and p-values mean: A simple example (with R code)

صورة
One day I realized that I finally really understood what “statistical significance” means (p __ .01). I had probably heard the term hundreds of times by then. If you are still struggling with the concept, I hope it doesn’t take you this long and perhaps this post can be of help. ... Continue reading: Finally understanding what “statistical significance” and p-values mean: A simple example (with R code) http://dlvr.it/SGpMxh

Using bayesian optimisation to tune a XGBOOST model in R

My first post in 2022! A very happy new year to anyone reading this. 😄 I was looking for a simple and effective way to tune xgboost models in R and came across this package called ParBayesianOptimization. Here’s a quick tutorial on how to use it ... Continue reading: Using bayesian optimisation to tune a XGBOOST model in R http://dlvr.it/SGn1Lk

Future Improvements During 2021

صورة
Continue reading: Future Improvements During 2021 http://dlvr.it/SGl5SN

Handling Categorical Data in R – Part 1

صورة
This is part 1 of a series on “Handling Categorical Data in R.” Almost every data science project involves working with categorical data, and we should know how to read, store, summarize, visualize & manipulate such data. Working with categor... Continue reading: Handling Categorical Data in R – Part 1 http://dlvr.it/SGjMY0

Little useless-useful R functions – Mastermind board game for R

صورة
Playing a simple guessing game with R. It’s called Mastermind game! This game was originally created for two people, but R version will be for single-player mode, when an R developer or R data scientists need a break. The gameplay…Read more › Continue reading: Little useless-useful R functions – Mastermind board game for R http://dlvr.it/SGhMBW

How renv restores packages from r-universe for reproducibility or production

This post is part of a series of technotes about r-universe, a new umbrella project by rOpenSci under which we experiment with various ideas for improving publication and discovery of research software in R. As the project evolves, we will post update... Continue reading: How renv restores packages from r-universe for reproducibility or production http://dlvr.it/SGg2K5

Reconciling the Gaussian and Whittle Likelihood with an application to estimation in the frequency domain

صورة
Overview Suppose \(\{X_t: t\in \mathbb{Z}\}\) is a second order stationary time series where \(c(r) = \text{cov}(X_{t+r},X_t)\) and \(f(\omega) = \sum_{r\in\mathbb{Z}}c(r)e^{ir\omega}\) are the corresponding autocovariance and spectral density fun... Continue reading: Reconciling the Gaussian and Whittle Likelihood with an application to estimation in the frequency domain http://dlvr.it/SGfbYt

Top 7 Best R Shiny Books and Courses That Are Completely Free

صورة
So, you want to become an R Shiny Developer? 2022 is the year to do it. Learning a new language, library, or framework can be stressful – even expensive at times! That’s why we decided to share the 7 best R Shiny books and courses you can follow from the comfort of ... Continue reading: Top 7 Best R Shiny Books and Courses That Are Completely Free http://dlvr.it/SGf60T

Google Season of Docs with R: useR! Information Board

صورة
“Google Season of Docs (GSoD) provides support for open source projects to improve their documentation and gives professional technical writers an opportunity to gain experience in open source.” (Source: Program website) Continue reading: Google Season of Docs with R: useR! Information Board http://dlvr.it/SGbqWl

MANOVA in R – How To Implement and Interpret One-Way MANOVA

صورة
The R programming language packs a rich set of statistical functions. It makes it easy to do any kind of statistical test, including the analysis of variance. Today you’ll learn all about MANOVA in R, and apply it to a real dataset. We’ll start with the theory and ... Continue reading: MANOVA in R – How To Implement and Interpret One-Way MANOVA http://dlvr.it/SGbMPc

R Studio with great new feature – multiple code panes

صورة
With October 2021 version of R Studio (2021.09.1 Preview) a great and – in my personal opinion – long awaited features is now available – multi windows or multi panes for viewing the R code. On R studio home page,…Read more › Continue reading: R Studio with great new feature – multiple code panes http://dlvr.it/SGZrhC

Interview with Oscar Baruffa, Creator of the Big Book of R

صورة
Welcome to the new year! If you’re itching to improve your R skills in 2022, we have the resource for you. We’re excited to share the Big Book of R. “Your last-ever bookmark”, the Big Book of R is an impressive collection of R-related books from a variety of ... Continue reading: Interview with Oscar Baruffa, Creator of the Big Book of R http://dlvr.it/SGY57D

Starting 2022 Off With A Fairly Complex {ggplot2} Recreation Plot

صورة
The New York Times had a [tragic] story on Covid deaths today and one of their plots really stuck with me for how well it told that part of the story. NOTE: The red panel highlights are off a bit as I manually typed the data in (I only did ... Continue reading: Starting 2022 Off With A Fairly Complex {ggplot2} Recreation Plot http://dlvr.it/SGX1kV

Announcing mlr3spatial

صورة
We are happy to announce that mlr3spatial has been released on CRAN in November 2021. mlr3spatial simplifies the handling of spatial objects in the mlr3 ecosystem. Before mlr3spatial, the user had to extract tabular data from spatial objects to tra... Continue reading: Announcing mlr3spatial http://dlvr.it/SGWV87

Using Arrow with Shiny

This post is an adaptation from Using databases with Shiny. Shiny apps are R’s answer to building interface-driven applications that help expose important data, metrics, algorithms, and more with end-users. However, the more interesting work that y... Continue reading: Using Arrow with Shiny http://dlvr.it/SGVysJ

Skeptical Bayesian priors might help minimize skepticism about subgroup analyses

صورة
Over the past couple of years, I have been working with an amazing group of investigators as part of the CONTAIN trial to study whether COVID-19 convalescent plasma (CCP) can improve the clinical status of patients hospitalized with COVID-19 and requiring noninvasive supplemental oxygen. This was a multi-site study in ... Continue reading: Skeptical Bayesian priors might help minimize skepticism about subgroup analyses http://dlvr.it/SGV33Y

SurvCART: Constructing Survival Tree in R

صورة
[Author: Madan G. Kundu] In practice, survival times may be influenced by a combination of baseline variables. Survival trees (i.e., trees with survival data) offer a relatively flexible approach to understanding the effects of covariates, including their interaction, on survival times when the functional form of the association is ... Continue reading: SurvCART: Constructing Survival Tree in R http://dlvr.it/SGTSYZ

How to perform Eta Squared in R

The post How to perform Eta Squared in R appeared first on finnstats. If you want to read the original article, click here How to perform Eta Squared in R. Eta Squared in R, Eta squared is a commonly-used effect size metric in ANOVA models. It is calculated as follows: ... Continue reading: How to perform Eta Squared in R http://dlvr.it/SGRpG7

Binary image classification using Keras in R: Using CT scans to predict patients with Covid

صورة
Here I illustrate how to train a CNN with Keras in R to predict from patients' CT scans those who will develop severe illness from Covid. Motivation Michael Blum tweeted about the STOIC2021 - COVID-19 AI challenge. The main goal of this challenge is to... Continue reading: Binary image classification using Keras in R: Using CT scans to predict patients with Covid http://dlvr.it/SGPb62

How to perform the Sobel test in R

The post How to perform the Sobel test in R appeared first on finnstats. If you want to read the original article, click here How to perform the Sobel test in R. How to perform the Sobel test in R, This tutorial will show you how to perform a Sobel ... Continue reading: How to perform the Sobel test in R http://dlvr.it/SGNkcp

How to install (and update!) R and RStudio

صورة
One of the first steps to learning R is to have it downloaded and installed on your computer. In this post I’ll show you how to do that and how to download and install RStudio—a key tool for using R, and how I do all my work and ... Continue reading: How to install (and update!) R and RStudio http://dlvr.it/SGNLGr

Chi-Square Goodness of fit formula in R

Visit finnstats for the most up-to-date information on Data Science, employment, and tutorials. If you want to read the original article, click here Chi-Square Goodness of fit formula in R. Knowing a few algorithms in depth is preferable to knowing a bit about a lot of algorithms, visit finnstats. Chi-square ... Continue reading: Chi-Square Goodness of fit formula in R http://dlvr.it/SGMZC4