المشاركات

عرض المشاركات من أبريل, 2022

Separating Code from Presentation in Jupyter Notebooks

صورة
One of the great conveniences of performing a data science style analysis using Jupyter is that Jupyter notebooks are literate containers that combine code, text, results, and graphs. This is also one of the pain points in working with Jupyter notebooks with partners or with source control. That is: Jupyter […] Continue reading: Separating Code from Presentation in Jupyter Notebooks http://dlvr.it/SPYLcr

Using R and {paws} to populate DynamoDB tables

R and AWS DynamoDB In recent weeks we’ve been using AWS services for a few our projects. One of the things that came up was to populate DynamoDB tables with data from R. Of course, we didn’t have to do this from R, but most of our data ... Continue reading: Using R and {paws} to populate DynamoDB tables http://dlvr.it/SPXblv

Diffify

صورة
You know that sinking feeling that you get when you’re months into a big project and you log in one day and nothing works? Turns out something has updated and things have been removed that y... Continue reading: Diffify http://dlvr.it/SPVkPV

Military Expenditure Estimation with RJAGS Simulation

صورة
The side effect of the ongoing Russian occupation was that it stimulated countries’ intentions to increase their military budgets. Last month following the invasion, Germany announced a budget of 100 billion € for the restructuring of the army and announced that it would spend %2 of its GDP every year. Of course, it ... Continue reading: Military Expenditure Estimation with RJAGS Simulation http://dlvr.it/SPVkP6

Little useless-useful R functions – Animating datasets

صورة
I firmly believe that animation and transition between different data states can give end-users much better insights and understanding of the data, than a single table with data points or correlation metrics. With help of ggplot, gganimate, you can quickly…Read more › Continue reading: Little useless-useful R functions – Animating datasets http://dlvr.it/SPVQmp

Don’t Lose your HEAD over Default Branches

Read on for a little trick to always get the default branch… While git has no “default” branch, online code platforms such as GitHub need to define a default for the branch that someone sees when they look at code online. While the ... Continue reading: Don’t Lose your HEAD over Default Branches http://dlvr.it/SPV2ql

March: “Top 40” New CRAN Packages

صورة
Two hundred and six new packages stuck to CRAN in March. Here are my “Top 40” selections in thirteen categories: Computational Methods, Data, Finance, Game Theory, Genomics, Machine Learning, Medicine, Networks, Science, Statistics, Time Series, Utilities, and Visualization. Computational Methods RCDT v1.1.0: Provides functions to perform 2D Delaunay triangulation, constrained or ... Continue reading: March: “Top 40” New CRAN Packages http://dlvr.it/SPRRlp

How to reshape your data in R for analysis

صورة
One of the toughest parts of data analysis is preparing your data to be analyzed. We often have to deal with problems like NAs, typos, and data that are formatted incorrectly. In this blog post, I’m specifically going to help you with that last o... Continue reading: How to reshape your data in R for analysis http://dlvr.it/SPR7M6

rstudio::conf(2022) Workshops

We are so excited for rstudio::conf! To start off the conference in July, we have an amazing line-up of workshops. There’s a session for you wherever you are on your data science journey. Get inspired and learn something new. You can find all wor... Continue reading: rstudio::conf(2022) Workshops http://dlvr.it/SPR75g

How to Deploy RStudio Connect into Local Kubernetes Cluster

صورة
RStudio Connect is a publishing platform that lets you and your team share R Shiny applications, R Markdown reports, APIs, dashboards, Jupyter Notebook, and interactive Python content in one convenient place. Today we’ll show you how to deploy RStudio Connect to the local Kubernetes cluster with ease. Following along ... Continue reading: How to Deploy RStudio Connect into Local Kubernetes Cluster http://dlvr.it/SPR72y

TidyTuesday and Barplots with Images

صورة
This week, the Kaggle Hidden Gems Competition is featured in a special crossover event with the fantastic TidyTuesday project to further strengthen the connections between the R & Kaggle communities. The TidyTuesday participants are invited to expand their beautifully crafted data visuals into narrative notebooks on the Kaggle platform, and receive ... Continue reading: TidyTuesday and Barplots with Images http://dlvr.it/SPQjXL

RStudio Community Table Gallery

Tables are an excellent way to organize your data, whether the medium is an R Markdown document, poster, or a Shiny app. We’ve collected many community contributions that shine in this regard and they are showcased in the new RStudio Community Ta... Continue reading: RStudio Community Table Gallery http://dlvr.it/SPJQpm

Using Shiny to Create an Academic Poster

TL;DR Run shiny::runGitHub('jbryer/ShinyPoster') see an example poster. Go to https://github.com/jbryer/ShinyPoster to download the template Introduction In the past several years academic conferences have begun to display poster presentations el... Continue reading: Using Shiny to Create an Academic Poster http://dlvr.it/SPJQnr

Bayesian analyses made easy: GLMMs in R package brms

صورة
Here I illustrate how to fit GLMMs with the R package brms, and compare to Jags and lme4. Motivation I regularly give a course on Bayesian statistics with R for non-specialists. To illustrate the course, we analyse data with generalized linear, often m... Continue reading: Bayesian analyses made easy: GLMMs in R package brms http://dlvr.it/SNpHpj

Text Analysis of Job Descriptions for Data Scientists, Data Engineers, Machine Learning Engineers and Data Analysts

صورة
Introduction In the previous post, the intrepid Jesse Blum and I analyzed metadata from over 6,500 job descriptions for data roles in seven European countries. In this post, we’ll apply text analysis to those job postings to better understand the technologies and skills that employers are looking for in data ... Continue reading: Text Analysis of Job Descriptions for Data Scientists, Data Engineers, Machine Learning Engineers and Data Analysts http://dlvr.it/SNnRqc

Linking R and Python to retrieve financial data and plot a candlestick

Are you interested in guest posting? Publish at DataScience+ via your RStudio editor.CategoryData ManagementTagsFinanceggplot2investpyR ProgrammingVisualizationI am way more experienced with R than with Python and prefer to code in this language when possible. This applies, especially when it is about visualizations. Plotly and ggplot2 are fantastic packages that ... Continue reading: Linking R and Python to retrieve financial data and plot a candlestick http://dlvr.it/SNm7m5

WTF is Kubernetes and Should I Care as R User?

صورة
I’m going to give you a high overview of kubernetes and how you can make your R work shine in kubernetes. Are you, an R-user in a company that uses kubernetes? building R applications (models that do predictions, shiny applications, APIs)? curi... Continue reading: WTF is Kubernetes and Should I Care as R User? http://dlvr.it/SNkBrw

Keeping Those SSH Keys Safe

صورة
I came across a neat site that uses a Golang wasm function called from javascript on the page to help you see if your GitHub public SSH keys are “safe”. What does “safe” mean? This is what the function checks for (via that site): Recommended key sizes are as follows: ... Continue reading: Keeping Those SSH Keys Safe http://dlvr.it/SNjh2m

Simple R merge method and how to compare it with T-SQL

صورة
Merge statement in R language is a powerful, simple, straightforward method for joining data frames. Nevertheless, it also serves with some neat features that give R users fast data wrangling. I will be comparing this feature with T-SQL language, to…Read more › Continue reading: Simple R merge method and how to compare it with T-SQL http://dlvr.it/SNjDdn

Ever wondered how the probability of the null hypothesis being true changes given a significant result?

TRIGGER WARNING: These simulations might fundamentally shake your belief system. USE WITH CARE. In a recently accepted paper in the open access journal Quantitative Methods for Psychology that Daniel Schad led, we discuss how, using Bayes' rule, one ca... Continue reading: Ever wondered how the probability of the null hypothesis being true changes given a significant result? http://dlvr.it/SNhczN

Trends in Back Pain

Brodeur, Abel, Andrew E Clark, Sarah Fleche, and Nattavudh Powdthavee. 2021. “COVID-19, Lockdowns and Well-Being: Evidence from Google Trends.” Journal of Public Economics 193: 104346. Ciaffi, Jacopo, Riccardo Meliconi, Maria Paola Landini, Luana Mancarella, Veronica Brusi, Cesare Faldini,... Continue reading: Trends in Back Pain http://dlvr.it/SNdY6L

Billionaires and Taxes

“Wealth inequality is increasing!”, “Rich people don’t pay enough taxes”, “THEY aren’t doing enough” — we have all heard these lines at some point in our life. Salaried people protesting against super-high taxes, college students protesting on behalf of everyone, and, of course, Bernie Sanders. How much do the ... Continue reading: Billionaires and Taxes http://dlvr.it/SNcLqn

R-Ladies Cologne joins the family

صورة
R-Ladies Cologne is a new R-Ladies chapter which was founded in September 2021. It joins R-Ladies' mission to promote and support gender equality in the field. Together with Luciana and Gabe, we have hosted three events so far. Kicking it off, we had a... Continue reading: R-Ladies Cologne joins the family http://dlvr.it/SNbPDS

How to collect dataviz from Twitter into your note-taking system

Intro It is mid-April and the #30daychartchallenge is well on its way. One glace at the hashtag’s Twitter feed suffices to realize that there are great contributions. That’s a perfect opportunity to collect data viz examples for future inspirations. Ideally, I can scroll through Twitter and with a ... Continue reading: How to collect dataviz from Twitter into your note-taking system http://dlvr.it/SNZvnz

Selection bias and observed values

صورة
I’m a man of a certain age and friend set, which means that I play a tabletop pen-and-paper roleplaying game called Pathfinder (similar to Dungeons and Dragons). You have a character with different values for strength, dexterity, constitution, charisma... Continue reading: Selection bias and observed values http://dlvr.it/SNY0vG

R-spatial evolution: retirement of rgdal, rgeos and maptools

Why this blog? Why rgeos and rgdal will retire Packages depending on rgeos and rgdal The Plan Packages depending on sp and raster [view raw Rmd] Summary: Packages rgdal, rgeos and maptools will retire by the end of 2023 . We describe wher... Continue reading: R-spatial evolution: retirement of rgdal, rgeos and maptools http://dlvr.it/SNT7tF

Windows update: JAGS 4.3.1 is released

JAGS version 4.3.1 is now available from Sourceforge. This is a patch update with a singular purpose: to allow a new Windows binary to be built using the Rtools42 toolchain. JAGS is built with the same compiler as R on … Continue reading → Continue reading: Windows update: JAGS 4.3.1 is released http://dlvr.it/SNSpbZ

The sftime Package

صورة
Which gap does the sftime package fill? A motivating example Geometrical operations and subsetting Plotting Another motivating example: earthquake events The sftime class Object structure Conversion from... Continue reading: The sftime Package http://dlvr.it/SNSPx3

Pulling Twitter Engagements Using the v2 API as Well as rtweet

This is a follow-up to a short post I wrote on R Access to Twitter’s v2 API. In this post I’ll walk through a few more examples of pulling data from twitter using a mix of Twitter’s v2 API as well as the {rtweet} package1. I’ll ... Continue reading: Pulling Twitter Engagements Using the v2 API as Well as rtweet http://dlvr.it/SNPx0H

RStudio Connect Python Minimum Version Update

What Administrators and Publishers should know The March 2022 RStudio Connect release removes support for Python 2 and updates the minimum version supported to Python 3.5. Why now? Python 2.7 has reached end of life (EOL) maintenance status. On Janua... Continue reading: RStudio Connect Python Minimum Version Update http://dlvr.it/SNPJng

The first draft of “Forecasting and Analytics with ADAM”

After working on this for more than a year, I have finally prepared the first draft of my online monograph “Forecasting and Analytics with ADAM“. This is a monograph on the model that unites ETS, ARIMA and regression and introduces advanced features in univariate modelling, including: ETS in a new ... Continue reading: The first draft of “Forecasting and Analytics with ADAM” http://dlvr.it/SNNx6B

RObservations #29 – Classifying and Filtering Coordinates By Using the sf Library

صورة
Introduction Geo-spatial analysis and visualizations is a powerful tool for providing insight bringing an idea or a result in a more tangible manner. Oftentimes, we are only interested in a specific points or we wan to classify the data we have by a larger location it belongs to. In this ... Continue reading: RObservations #29 – Classifying and Filtering Coordinates By Using the sf Library http://dlvr.it/SNLTqP

Indexing Iterations with set_names()

As mentioned last time, we often want to build up a data frame iteratively. The map() family of functions in purrr can help with this. Here I’ll show a handy pattern for keeping track of what you’ve added to the data frame you’re making. The map_dfr() ... Continue reading: Indexing Iterations with set_names() http://dlvr.it/SNKyPB

A Fresh Lick of Paint

A couple of years ago I moved house from Wordpress to Blogdown. It’s a less stressful life and I plan to stay. Hugo Academic served me well, but sometimes you just need a fresh coat of paint. Continue reading: A Fresh Lick of Paint http://dlvr.it/SNH2dB

Iterating on the GSS

Let’s say we’re working with the General Social Survey. We’re interested in repeatedly fitting some model each year to see how some predictor changes over time. For example, the GSS has a longstanding question named fefam, where respondents are asked to give their opinion on the following ... Continue reading: Iterating on the GSS http://dlvr.it/SNFhXN

Upcoming R conferences (2022)

صورة
The daffodils are out, the sun is shining(ish), spring is in the air! And that means that it’s time to start planning for the conference season. This year we’ll be treated to a lot more in person ... Continue reading: Upcoming R conferences (2022) http://dlvr.it/SNFRT3

Australian GP FP2

صورة
Hello, this is a new series which I will be analysing the data from Friday practice at the Grand Prix. The focus is going to be on strategy so key to a Formula 1 race. This is the first version of this analysis, lookout for more information on future races Tyre ... Continue reading: Australian GP FP2 http://dlvr.it/SNDlZS

Mark Your Calendar for the Appsilon Shiny Conference

We’re excited to team up with our Full Service Partner, Appsilon, as they host their first virtual Shiny conference. What should I expect? Three days of free, online Shiny content ranging from tips and tricks from the experts, to fascinating community ... Continue reading: Mark Your Calendar for the Appsilon Shiny Conference http://dlvr.it/SNBJV0

SFTP in R on a Mac

I am working on a project where I need to upload PDFs generated from Rmarkdown to a SFTP server. The sftp R package is a nice wrapper to the RCurl package for handling SFTP access. But to my surprise, SFTP support is not included on Macs by default through the ... Continue reading: SFTP in R on a Mac http://dlvr.it/SN9f92

Make robust, modular dashboards with golem and graveler

Photo courtesy of Pixabay This is a guest post from Alan Carlson at Snap Finance. As the Tech Lead for the Business Intelligence (BI) team, Alan's primary focus at Snap is researching, creating, and maintaining methods that help the rest of Snap’s BI ... Continue reading: Make robust, modular dashboards with golem and graveler http://dlvr.it/SN9F8R

How to make a boxplot in R

صورة
In this tutorial, I’m going to show you how to plot and customize boxplots (also known as box and whisker plots). Boxplots are a common type of graph that allow you to look at the relationships between a continuous variable and various categorica... Continue reading: How to make a boxplot in R http://dlvr.it/SN5fFl

Paper: ‘Semi-Markov modeling for cancer insurance’

صورة
I am happy to announce that our paper entitled “Semi-Markov modeling for cancer insurance” has been accepted for publication in the European Actuarial Journal. Advancements in medicine and biostatistics have already resulted in a better access to ... Continue reading: Paper: ‘Semi-Markov modeling for cancer insurance’ http://dlvr.it/SN586n

ISC Call for Proposals – Now Accepting Applications

صورة
The first 2022 call for proposals for R Consortium Infrastructure Steering (ISC) grants is now open and will be accepting proposals through May 1, 2022. This year, the R Consortium has... The post ISC Call for Proposals – Now Accepting Applications appeared first on R Consortium. Continue reading: ISC Call for Proposals – Now Accepting Applications http://dlvr.it/SN35PK

R Shiny in Government – Top 7 Dashboards You Should See

صورة
R Shiny allows developers to write production-ready dashboards in no time, and for any use case. It’s not rare to find a vast number of dashboards in business-related domains, such as sales and marketing, but the possibilities don’t end there. Today we bring you seven curated examples of ... Continue reading: R Shiny in Government – Top 7 Dashboards You Should See http://dlvr.it/SN21tt

Teaching Data Science in the Cloud

صورة
Photo by Chris Montgomery on Unsplash Data science and programming languages like R and Python are some of the most in-demand skills in the world. RStudio Cloud is a simple but powerful solution for teaching and learning analytics at scale. RStudio Clo... Continue reading: Teaching Data Science in the Cloud http://dlvr.it/SMz8lD

News from ESGtoolkit, ycinterextra, and nnetsauce

صورة
News from R packages ESGtoolkit, ycinterextra, and Python/R package nnetsauce Continue reading: News from ESGtoolkit, ycinterextra, and nnetsauce http://dlvr.it/SMylqW

Update On ESC/POS Package Work

صورة
The morning before work was super productive and there is a nigh-complete DSL for ESC/POS commands along with the ability to just print {ggplot2}/{grid} object. I changed the package name to {escpos} since it is no longer just plot object focused, and the DSL looks a bit like ... Continue reading: Update On ESC/POS Package Work http://dlvr.it/SMyJmr

{emayili} Encrypted Email with Mailfence

صورة
In the previous post I ran through the process of setting up a Mailfence account for encrypting emails. In this post I show how Mailfence can be used with the {emayili} package for sending encrypted email from R. Continue reading: {emayili} Encrypted Email with Mailfence http://dlvr.it/SMwfB9

Turning {ggplot2} Into a PoS (Point-of-Sale) System

صورة
At the end of March, I caught a fleeting tweet that showcased an Epson thermal receipt printer generating a new “ticket” whenever a new GitHub issue was filed on a repository. @aschmelyun documents it well in this blog post. It’s a pretty cool hack, self-contained on a Pi Zero. ... Continue reading: Turning {ggplot2} Into a PoS (Point-of-Sale) System http://dlvr.it/SMv1HG

Greatly Revised Edition of Tidyverse Skeptic

As a longtime R user and someone with a passionate interest in how people learn, I continue to be greatly concerned about the use of the Tidyverse in teaching noncoder learners of R. Accordingly, I have now thoroughly revised my Tidyverse Skeptic essay. It is greatly reorganized with focus on ... Continue reading: Greatly Revised Edition of Tidyverse Skeptic http://dlvr.it/SMtJlg

DuckDB: Quacking SQL

"R Python" continued... DuckDB: Quacking SQL Continue reading: DuckDB: Quacking SQL http://dlvr.it/SMrKTZ

The {chronicler} package, an implementation of the logger monad in R

صورة
Back in February I discussed a package I was working on which allowed users to add logging to function calls. I named the package {loudly} but decided to rename it to {chronicler}. I have been working on it for the past few weeks, and I think that a CRAN release ... Continue reading: The {chronicler} package, an implementation of the logger monad in R http://dlvr.it/SMq036

RStudio Community Monthly Events Roundup – April 2022

Photo by Nick Morrison on Unsplash Welcome to RStudio Community Monthly Events Roundup, where we update you on upcoming virtual events happening at RStudio this month. Missed the great talks and presentations from last month? Find them listed under I... Continue reading: RStudio Community Monthly Events Roundup – April 2022 http://dlvr.it/SMpl2h