المشاركات

عرض المشاركات من مارس, 2022

New(ish) paper: Share the code, not just the data: A case study of the reproducibility of JML articles published under the open data policy

Here's an important new paper led by Dr. Anna Laurinavichyute on the reproducibility of published analyses. This paper by commissioned by the editor in chief of the Journal of Memory and Language, Kathy Rastle.Title: Share the code, not just ... Continue reading: New(ish) paper: Share the code, not just the data: A case study of the reproducibility of JML articles published under the open data policy http://dlvr.it/SMkg5V

What is a horizon chart?

صورة
A horizon chart is a compact version of an area chart. In the words of Jonathan Schwabish (Reference 1, page 164), it is … an area chart that is sliced into equal horizontal intervals and collapsed down into single bands, … Continue reading → Continue reading: What is a horizon chart? http://dlvr.it/SMjZyB

Track Shiny App User Activity With the RStudio Connect Server API

صورة
Data scientists spend a lot of time creating apps, dashboards, and reports. All of this effort is often hampered by siloed workflows between coworkers and across teams, which leads to delays in presenting your insights to stakeholders and clients. After all that time and effort, are you even sure what ... Continue reading: Track Shiny App User Activity With the RStudio Connect Server API http://dlvr.it/SMgZNt

scikit-learn models in R with reticulate

صورة
I have tried to venture into Python several times over the years. The language itself seems simple enough to learn but as someone who has only ever used R (and a bit of Stata), there were two things that held me back: I never really found an IDE th... Continue reading: scikit-learn models in R with reticulate http://dlvr.it/SMc3Fg

rsnps 0.5.0: New ncbi_snp_query() Features

TL;DR rsnps is a package that enables the retrieval of single nucleotide polymorphism (SNP) data from the NCBI’s dbSNP database and openSNP by providing wrappers for the APIs. Single nucleotide polymorphisms represent differences at one specifi... Continue reading: rsnps 0.5.0: New ncbi_snp_query() Features http://dlvr.it/SMbdvl

Using R to detect the pressure wave from the 2022 Hunga Tonga eruption in personal weather station data

صورة
It seems like an age ago, but in fact it was only mid-January 2022 when this happened: The answers are yes and yes again. One excellent source of weather station data is the Weather Underground. They used to have an API which could be accessed through an R package, rwunderground. This ... Continue reading: Using R to detect the pressure wave from the 2022 Hunga Tonga eruption in personal weather station data http://dlvr.it/SMb798

Shiny Wordle Word Journey

صورة
A couple of weeks ago, Winston Chang showed how to create a Wordle app in Shiny in a four-part video series. Go watch the video series to see how he did it! Also, follow along on the first steps to start with Shiny in this tutorial. In our house, we... Continue reading: Shiny Wordle Word Journey http://dlvr.it/SMXZnJ

Nuclear Threat Projection with Neural Network Time Series Forecasting

صورة
Unfortunately, we have been through tough times recently as going on Russian invasion in Ukraine. As Putin stacked to the corner via sanctions and lost in the field, he has been getting to be more dangerous. He has even threatened to use nuclear weapons if necessary. Because of the nuclear ... Continue reading: Nuclear Threat Projection with Neural Network Time Series Forecasting http://dlvr.it/SMW8z3

An R alternative to pairs for -omics QC

صورة
Are you interested in guest posting? Publish at DataScience+ via your RStudio editor.CategoryBasic StatisticsTagsData Visualisationggplot2R ProgrammingtidyverseIntroduction The Problem: I've got a couple of problems with the commonly used “pairs” plot in R for quality control in -omics data. (1) It's not that space-efficient since it only uses half the ... Continue reading: An R alternative to pairs for -omics QC http://dlvr.it/SMVc0W

RObservations #27: Canadian Prime Minister’s Dataset (my “first” Kaggle submission)

صورة
No. Name Political Party Term Start Term End 1 (1 of 2) John A. Macdonald Liberal-Conservative 1867-07-01 1873-11-05 2 Alexander Mackenzie Liberal 1873-11-07 1878-10-08 1 (2 of 2) John A. Macdonald Liberal-Conservative 1878-10-17 1891-06-06 3 John Abbott Liberal-Conservative 1891-06-16 1891-11-24 4 John Thompson Liberal-Conservative 1892-12-05 1894-12-12 5 Mackenzie Bowell Conservative […] Continue reading: RObservations #27: Canadian Prime Minister’s Dataset (my “first” Kaggle submission) http://dlvr.it/SMV6qC

Kruskal-Wallis test, or the nonparametric version of the ANOVA

صورة
Introduction Data Kruskal-Wallis test Aim and hypotheses Assumptions In R Interpretations Post-hoc tests Dunn test Pairwise Wilcoxon test Combination of statistical results and plot Summary References Introduction In a previous article, we showed how to do an ANOVA in R to compare three or more groups. Remember that, as for ... Continue reading: Kruskal-Wallis test, or the nonparametric version of the ANOVA http://dlvr.it/SMRGgt

Some R Conferences for 2022

صورة
The 2022 R Conference season is already underway. Here is a list of upcoming conferences that we know about. If we have missed your conference, please write to us with the details. We will update our list as we receive more information. April (27 - 29) on-line and free - Appsilon Shiny Conference ... Continue reading: Some R Conferences for 2022 http://dlvr.it/SLmfJn

Curating Your Data Science Content on RStudio Connect

RStudio Connect is RStudio’s publishing platform that hosts data science content created in R or Python, such as R Markdown documents, Shiny apps, Jupyter Notebooks, and more. As you publish to RStudio Connect, you will want your audience to have... Continue reading: Curating Your Data Science Content on RStudio Connect http://dlvr.it/SLm13b

Dashboards in R Shiny

صورة
Data can be transformative when loaded through business intelligence software for strategic decision-making. The insights generated empower businesses to improve their processes, initiatives, and innovations. But can we improve the way business decision-makers access these insights? Enter the world of dashboards in R Shiny and see how RStudio and open ... Continue reading: Dashboards in R Shiny http://dlvr.it/SLlYq5

The E8 root polytope

صورة
The E8 root polytope, its vertices and its edges The E8 root polytope, also known as the \(4_{21}\) polytope is a 8-dimensional polytope. The Cartesian coordinates of its vertices ar... Continue reading: The E8 root polytope http://dlvr.it/SLhVB2

Confidence Intervals Explained

The post Confidence Intervals Explained appeared first on finnstats. If you want to read the original article, click here Confidence Intervals Explained. Are you looking for the latest Data Science Job Vacancies / Internship then click here finnstats. We encourage that you read this article from finnstats to stay up to ... Continue reading: Confidence Intervals Explained http://dlvr.it/SLh3xP

A zsh Helper Script For Updating macOS RStudio Daily Electron + Quarto CLI Installs

RStudio’s macOS Electron build is coming along quite nicely and is blazing fast on Apple Silicon. I like to install the dailies, well, daily!; and, of late, RStudio and Quarto are joined at the hip. As a result, I regularly found myself having to manually update Quarto CLI right ... Continue reading: A zsh Helper Script For Updating macOS RStudio Daily Electron + Quarto CLI Installs http://dlvr.it/SLd9gg

repoRter.nih: a convenient R interface to the NIH RePORTER Project API

صورة
Introduction The US National Institute of Health (NIH) received funding of approximately $42 billion in fiscal year 2022; $31 billion (72%) of this was awarded by the NIH in the form of research grant funding to hospitals, medical colleges, non-profits, businesses, and other organizations based in the U.S. and abroad.[https://nexus.od.... Continue reading: repoRter.nih: a convenient R interface to the NIH RePORTER Project API http://dlvr.it/SLct5q

Predictive Analytics Models in R

The post Predictive Analytics Models in R appeared first on finnstats. If you want to read the original article, click here Predictive Analytics Models in R. Are you looking for the latest Data Science Job Vacancies / Internship then click here finnstats. We encourage that you read this article from finnstats ... Continue reading: Predictive Analytics Models in R http://dlvr.it/SLcXt1

Markov Chain Introduction in R

The post Markov Chain Introduction in R appeared first on finnstats. If you want to read the original article, click here Markov Chain Introduction in R. Are you looking for the latest Data Science Job Vacancies / Internship then click here finnstats. ... Continue reading: Markov Chain Introduction in R http://dlvr.it/SLbsFm

Capture errors, warnings and messages

In my last video I tried to add a feature to my {loud} package (more info here) and I succeeded. But in succeeding in realised that I would need to write a bit more code than what I expected. To make a long story short: it is possible to capture... Continue reading: Capture errors, warnings and messages http://dlvr.it/SLbKqz

Monte Carlo Analysis in R

The post Monte Carlo Analysis in R appeared first on finnstats. If you want to read the original article, click here Monte Carlo Analysis in R. Are you looking for the latest Data Science Job Vacancies / Internship then click here finnstats. We encourage that you read this article from finnstats ... Continue reading: Monte Carlo Analysis in R http://dlvr.it/SLZBsr

Stock Market Predictions Next Week

The post Stock Market Predictions Next Week appeared first on finnstats. If you want to read the original article, click here Stock Market Predictions Next Week. Are you looking for the latest Data Science Job Vacancies / Internship then click here finnstats. We encourage that you read this article from finnstats ... Continue reading: Stock Market Predictions Next Week http://dlvr.it/SLYSgb

A prerelease version of Jupyter Notebooks and unleashing features in JupyterLab

صورة
Jupyter notebook offers also the use of developers or prerelease versions of Jupyter notebooks. What you need to do is simply run: And with this prerelease version of the Jupyter notebook, you have in addition several options to enhance your…Read more › Continue reading: A prerelease version of Jupyter Notebooks and unleashing features in JupyterLab http://dlvr.it/SLY7nR

Dashboard Framework Part 2: Running Shiny in AWS Fargate with CDK

صورة
Dashboard Framework Part 2: Running Shiny in AWS Fargate with CDK In the previous post we outlined the architecture of a dashboard framework to run dashboards based on multiple technologies including Shiny and Flask in production. We will now show how... Continue reading: Dashboard Framework Part 2: Running Shiny in AWS Fargate with CDK http://dlvr.it/SLWtJm

Something to note when using the merge function in R

Base R has a merge function which does join operations on data frames. As the documentation says, the function [merges] two data frames by common columns or row names, or do other versions of database join operations. One thing that I … Continue reading → Continue reading: Something to note when using the merge function in R http://dlvr.it/SLVqBF

Self-documenting plots in ggplot2

صورة
When I am showing off a plotting technique in ggplot2, I sometimes like to include the R code that produced the plot as part of the plot. Here is an example I made to demonstrate the debug parameter in element_text(): library(ggplot2) self_document( ... Continue reading: Self-documenting plots in ggplot2 http://dlvr.it/SLTWbX

Data Challenges for R Users

صورة
Today, I want to write about “data challenges”, where participants partake in a series of prompts designed for a variety of skill sets and levels. These challenges are opportunities to practice programming skills, work on algorithm... Continue reading: Data Challenges for R Users http://dlvr.it/SLTDfC

simplevis: new & improved!

صورة
simplevis version 6.2.0 has arrived with tonnes of new features. So, simplevis, if you haven’t heard of it, is a package of ggplot2 and leaflet wrapper functions. It aims to make visualisation easier on the brain, so you can save your thinking for ... Continue reading: simplevis: new & improved! http://dlvr.it/SLSQDL

Checking the inputs of your R functions

Are you, like we were, tired of filling your functions with argument checking code that sometimes ends up being longer that the core of the function itself? Are you trying to find what is the most efficient approach to check inputs easily and without f... Continue reading: Checking the inputs of your R functions http://dlvr.it/SLRvSq

Imputing missing values in R

The post Imputing missing values in R appeared first on finnstats. If you want to read the original article, click here Imputing missing values in R. Are you looking for the latest Data Science Job Vacancies / Internship then click here finnstats. We encourage that you read this article from finnstats ... Continue reading: Imputing missing values in R http://dlvr.it/SLNvZ1

Creating a Dashboard Framework with AWS (Part 1)

صورة
Creating a Dashboard Framework with AWS (Part 1) R-Shiny is an excellent framework to create interactive dashboards for data scientists with no extensive web development experience. Similar technologies in other languages include the Flask, Dash or St... Continue reading: Creating a Dashboard Framework with AWS (Part 1) http://dlvr.it/SLMqKg

BensstatsTalks#3: 5 Tips for Landing a Data Professional Role

صورة
Disclaimer: This was originally written on my Medium blog here, so the formatting is a little different from my usual style. If you just got started or have been working a while in a data role, the jargon thrown around can sometimes get overwhelming with all the things to need ... Continue reading: BensstatsTalks#3: 5 Tips for Landing a Data Professional Role http://dlvr.it/SLMHHs

Getting to know Julia

I thought I’d try Julia out and see how far I could get with nothing but Google on my side. Continue reading: Getting to know Julia http://dlvr.it/SLLqjM

Complete tutorial on using ‘apply’ functions in R

صورة
Today I’m going to talk about a useful family of functions that allows you to repetitively perform a specified function (e.g., sum(), mean()) across a vector, list, matrix, or data frame. For those of you familiar with ‘for’ loops, th... Continue reading: Complete tutorial on using ‘apply’ functions in R http://dlvr.it/SLKRjB

Bootstraps & Bandings

صورة
Are the distinct residential property bands of 3 decades ago becoming less so? Over the years, urban properties have been added to and divided up. And two streets of equal attractiveness, and with equivalently-banded properties, may have diverged as neighbourhoods evolved. Whilst properties can and do move to higher or lower ... Continue reading: Bootstraps & Bandings http://dlvr.it/SLJvNL

rstudio::conf(2022) is open for registration!

rstudio::conf, the conference for all things R and RStudio, will take place July 25-28 in National Harbor, DC! As usual, we’ll have two days of workshops followed by two days of talks. If you’ve attended before and already know you want to ... Continue reading: rstudio::conf(2022) is open for registration! http://dlvr.it/SLGgJX

How to Calculate a Cumulative Average in R

The post How to Calculate a Cumulative Average in R appeared first on finnstats. If you want to read the original article, click here How to Calculate a Cumulative Average in R. Are you looking for the latest Data Science Job Vacancies / Internship then click here finnstats. We encourage that ... Continue reading: How to Calculate a Cumulative Average in R http://dlvr.it/SLGJVX

Some thoughts about the use of cloud services and web APIs in social science research

In the recent weeks I’ve collaborated on the online book APIs for social scientists and added two chapters: a chapter … Read More → Continue reading: Some thoughts about the use of cloud services and web APIs in social science research http://dlvr.it/SLDmxQ

Convert data frame to array in R

The post Convert data frame to array in R appeared first on finnstats. If you want to read the original article, click here Convert data frame to array in R. Are you looking for the latest Data Science Job Vacancies / Internship then click here finnstats. We encourage that you read ... Continue reading: Convert data frame to array in R http://dlvr.it/SLCn85

Retweet Network Analysis in Cryptocurrencies

صورة
In this tutorial, we will run a simple network analysis on retweets that contain the hashtag “#Crypto” by taking into ... Read moreRetweet Network Analysis in Cryptocurrencies Continue reading: Retweet Network Analysis in Cryptocurrencies http://dlvr.it/SLBwFw

WAFEDA (Web App For Exploratory Data Analysis): Built on RShiny

WAFEDA, sounds weird right. I couldn’t come up with a better name. This web app is meant to perform basic exploratory data analysis on browsers courtesy of Rshiny. The app accepts only csv and tsv files and is able to plot various visualizations depend... Continue reading: WAFEDA (Web App For Exploratory Data Analysis): Built on RShiny http://dlvr.it/SLBDjK

Conditional RNN in keras (R) to deal with static features

صورة
Conditional RNN is one of the possible solutions if we’d like to make use of static features in time series forecasting. For example, we want to build a model, which can handle multiple time series with many different characteristics. It can be a mode... Continue reading: Conditional RNN in keras (R) to deal with static features http://dlvr.it/SL8yyz

How to Get Twitter Data using R

صورة
In a previous post, we showed how to get Twitter data using Python. In this tutorial, we will show you ... Read moreHow to Get Twitter Data using R Continue reading: How to Get Twitter Data using R http://dlvr.it/SL7Sy0

Version 0.12.2 of NIMBLE released, including an important bug fix for some models using Bayesian nonparametrics with the dCRP distribution

We’ve released the newest version of NIMBLE on CRAN and on our website. NIMBLE is a system for building and sharing analysis methods for statistical models, especially for hierarchical models and computationally-intensive methods (such as MCMC and SMC). Version 0.12.2 is a bug fix release. In particular, this release fixes ... Continue reading: Version 0.12.2 of NIMBLE released, including an important bug fix for some models using Bayesian nonparametrics with the dCRP distribution http://dlvr.it/SL6S93

Random Number Generator with Random Package

The post Random Number Generator with Random Package appeared first on finnstats. If you want to read the original article, click here Random Number Generator with Random Package. Are you looking for the latest Data Science Job vacancies then click here finnstats. The post Random Number Generator with Random Package ... Continue reading: Random Number Generator with Random Package http://dlvr.it/SL5TG1

How to use Fonts and Icons in ggplot

صورة
For some reason, using other than the default font in plots has been a major problem for me in R. Supposedly, one can use the extrafont package to manage all of that but I found it too cumbersome. Instead, I found out that the showtext package can make my life ... Continue reading: How to use Fonts and Icons in ggplot http://dlvr.it/SL4zD2

Reviewing my First Shiny Project (1/n) – Buttons

My first serious Shiny project is finished: 4517 lines of code (without comments)! Now I’m taking the time to go through the code again and reflect. What has turned out well and can be continued in the future? What are problem areas and need to be rewo... Continue reading: Reviewing my First Shiny Project (1/n) – Buttons http://dlvr.it/SL2hJr

WAFEDA (Web App For Exploratory Data Analysis): Built on RShiny

WAFEDA, sounds weird right. I couldn’t come up with a better name. This web app is meant to perform basic exploratory data analysis on browsers courtesy of Rshiny. The app accepts only csv and tsv files and is able to plot various visualizations de... Continue reading: WAFEDA (Web App For Exploratory Data Analysis): Built on RShiny http://dlvr.it/SL1K3x

How to use pipes to clean up your R code

صورة
I’ve talked a little bit about pipes (written as %__%) in a past blog post, but they’re so important in R that I thought they deserved their own post. In this tutorial, I’m going to give an explanation of what pipes are and when the... Continue reading: How to use pipes to clean up your R code http://dlvr.it/SKy9HQ

Best AI Courses Online-Free

The post Best AI Courses Online-Free appeared first on finnstats. If you want to read the original article, click here Best AI Courses Online-Free. Are you looking for the latest Data Science Job vacancies then click here finnstats. The post Best AI Courses Online-Free appeared first on finnstats. best ai ... Continue reading: Best AI Courses Online-Free http://dlvr.it/SKxdKB

RStudio Community Monthly Events Roundup – March 2022

Photo by Nick Morrison on Unsplash Welcome to RStudio Community Monthly Events Roundup, where we update you on upcoming events happening at RStudio this month. Missed the great talks and presentations from last month? Find them listed under ICYMI: Fe... Continue reading: RStudio Community Monthly Events Roundup – March 2022 http://dlvr.it/SKv4zH

Interaction Plot in R: How to Visualize Interaction Effect Between Variables

صورة
By far the easiest way to detect and interpret the interaction between two-factor variables is by drawing an interaction plot in R. It displays the fitted values of the response variable on the Y-axis and the values of the first factor on the X-axis. The second factor is represented through ... Continue reading: Interaction Plot in R: How to Visualize Interaction Effect Between Variables http://dlvr.it/SKtgtP

The modified stereographic projection

صورة
Some of my 3D animations start with a 4D object (such as a polytope) and I project it to the three-dimensional space with a stereographic projection. For example, the hyperbolic gircope. For this animatio... Continue reading: The modified stereographic projection http://dlvr.it/SKrdwy