المشاركات

tabulapdf: Extract Tables from PDF Documents

Motivation I had to extract multiple tables from PDF files and do some data analysis in R. I found that updating tabulizer (now retired from CRAN) to use a Java version newer than Java 8 (deprecated) was worth it to complete this task. tabulapdf ... Continue reading: tabulapdf: Extract Tables from PDF Documents http://dlvr.it/T6DgjH

Sketchy waffle charts in R

صورة
Waffle charts are a common way to visualise counts or percentages of categorical data. There are already several excellent ways of creating waffle charts in R - including approaches using {ggplot2} or {waffle}. This blog post uses neither of those. Instead, it describes a somewhat back-to-basics approach of simply drawing ... Continue reading: Sketchy waffle charts in R http://dlvr.it/T6B3WY

Guest Post: Introducing the polyglotr package

صورة
Announcing the polyglotr package polyglotr The package polyglotr is tool for language translation within the R programming environment. This package stands out for its ability to integrate with a variety of free translation services, making... Continue reading: Guest Post: Introducing the polyglotr package http://dlvr.it/T69gTk

Evenly Spaced Month Charts

صورة
I recently noticed that ggplot2 spaces date axes literally even when grouped by month. I’ve been using ggplot2 extensively for years and I don’t remember noticing before, so this is not really a big deal, but now that I know it bugs me a lot. Take a... Continue reading: Evenly Spaced Month Charts http://dlvr.it/T68JFS

Office365 AddIns for R (Part III)

صورة
A while back, I introduced the __ExcelRAddIn__ ([Office365 AddIns for R (Part I)]( https://adam-gladstone.github.io/r-project/Office365AddIns-for-R-part-I/)). This is an Office365 AddIn that allows you to evaluate an R-script from within Excel and use the results. This blog-post describes some of the recent updates to the ExcelRAddIn. Continue reading: Office365 AddIns for R (Part III) http://dlvr.it/T67k16

Backtesting

صورة
The key to successful backtesting is to ensure that you only use the data that were available at the time of the prediction. No “future” data can be included in the model training set, otherwise the model will suffer from look-ahead bias (having unrealistic access to future data). Continue reading: Backtesting http://dlvr.it/T67jmS

{emayili} Support for Mailtrap

صورة
The {emayili} package has adapters which make it simple to send email via a variety of services. For example, it caters specifically for ZeptoMail, MailerSend, Mailfence and Sendinblue. The latest version of {emayili}, 0.8.0 published on 23 April 2024, adds an an adapter for Mailtrap. Continue reading: {emayili} Support for Mailtrap http://dlvr.it/T64PcZ