Posts

Statistical computing on a shoestring: Stan in the Azure cloud using Cloud-init

Automating deployment of an inexpensive Linux R/Stan development environment in the Azure public cloud.

The Super League: Tournament Blood Bowl online

Yet another Blood Bowl post! This one is to introduce the Super League, the ultimate tournament-style Blood Bowl available online.

Visualizing Blood Bowl Tournament rosters

Yet another Blood Bowl post! This one is to warm up to the upcoming World Cup, analyzing variation in Tournament Roster choices.

Classifying Blood Bowl teams using clustered heatmaps

Restate my assumptions: If you graph the numbers of any system, patterns emerge. In this post we’ll use clustered heatmaps to graph the numbers from the Blood Bowl Fantasy football game, and see what patterns emerge!

Optimal performance with Random Forests: does feature selection beat tuning?

This blog post demonstrates that the presence of irrelevant variables can reduce the performance of the Random Forest algorithm (as implemented in R by ranger()). The solution is either to tune one of the algorithm’s parameters, OR to remove irrelevant features using a procedure called Recursive Feature Elimination (RFE).

Nufflytics: Analyzing Blood Bowl matches from FUMBBL using Python

This blogpost is about Blood Bowl, a boardgame I started playing last year. The idea of this blog post is to showcase some possible analyses that can be done on the FUMBBL match data I’ve compiled.

OpenJDK and IcedTea: Java Web Start Forensics on Ubuntu

To play Blood Bowl online on FUMBBL.com, a Java client is used that works with Java Web Start. On my Ubuntu linux system, open source versions of java and java web start (openJDK and IcedTea) take care of this. This post describes my suffering caused by the client not working anymore after a Ubuntu software update, and might be helpful for others encountering the same issues.

Construction of a public dataset of Blood Bowl matches played on FUMBBL.com

This blogpost is about Blood Bowl, a boardgame I started playing last year. The goal of this blog post is to use Python API and HTML scraping to fetch Blood Bowl match outcome data from FUMBBL.com, and to create a structured dataset ready for analysis and visualization.

Using R to analyse the Roche Antigen Rapid Test: How accurate is it?

This blog post is about the Roche Rapid Antigen Test Nasal. How accurate is it? I tracked down the data mentioned in the kit’s leaflet, discuss the whole measurement process and used R to reproduce the sensitivity and specificity of the test.

Writing scientific papers using Rstudio and Zotero

This blog post describes a sequence of 9 steps to set up a reproducible workflow for scientific writing based on open-source tooling. It boils down to writing the manuscript in Rmarkdown, and using a set of auxiliary tools to manage citations and output to Word to share with collaborators and to prepare the final document for submission to the journal.