📊

Data Scientist (with R)

Newsletters sent once a week, unsubscribe anytime.

Published 16th September 2025

📦 New R packages, CRAN news, and community updates

I Vibe Coded an R Package (jcarroll​.com​.au). Describes building an R package querying WaniKani API, visualizing kanji similarity, with Claude Code AI, caching, tests, and a Shiny app

Our September issue is out now! (methodsblog​.com). DeepDiveR for palaeodiversity estimation; field heatwave simulation; synergyfinder in ecotoxicology; ML in marine imagery; biases in home range tracking; airborne eDNA samplers

Help Us Design a New Git(Hub) Organizational Dashboard (ropensci​.org). ROpenSci designs an organizational dashboard to map contributors, repositories, and maintenance needs across epiverse, reconhub, and epiverse-trace on GitHub

R Weekly 2025-W38 R6 Interfaces, ggplot2 (rweekly​.org). R6 Interfaces for Backend and ggplot2 4.0.0; insights from R consortium updates and new packages

The ‘Tanakh’ R package (pacha​.dev). R package Tanakh provides tidy, verse-level Hebrew Bible access with English/texts sourced from Chabad.org using RSelenium and purrr

Boost Your Career with Jumping Rivers Free Monthly Webinars – Next Session on 18th September (jumpingrivers​.com). Free monthly webinar series focused on R, Python, Shiny, and Posit with automated reporting via Quarto and Litmus insights

wbstats is back on CRAN (pacha​.dev). Wbstats returns to CRAN after policy-aligned updates, including mocked tests with vcr and documentation tweaks

🎓 Teaching, reporting, and open science with/around R

The world’s laziest peer reviewer (reeserichardson​.blog). Dataset on peer review reports from six BMC journals reveals a review mill and calls for reform in peer review transparency

Paper in a prompt creating a first draft in one prompt (seascapemodels​.org). Prompts using Roo Code with Claude Sonnet 4.0 to draft a marine science paper, including data analysis and web-sourced references

Recreating APA Manual Table 7.1 in R with apa7 (wjschne​.github​.io). Recreating APA Table 7.1 in R using apa7, flextable, and tidyverse for reproducible APA-style tables

Recreating APA Manual Table 7.2 in R with apa7 (wjschne​.github​.io). Recreating APA Table 7.2 in R using apa7, flextable, ftExtra, and tidyverse with custom formatting

Justify your answer (argmin​.net). Three views of optimal decisions—gambling odds, likelihood ratios, and random-phenomena—unifying decision rules in ML, detection theory, and betting

Day 4 of Teaching Gov 50 at Harvard (causalinf​.substack​.com). Teaching potential outcomes through concrete narratives, spreadsheets, and Rebecca Thornton’s HIV study; integrating R, dplyr, ggplot2, and hands-on coding

Closing my tabs (Sep 12, 2025) (blog​.stephenturner​.us). Curated AI in academia, biosecurity, CRISPR horses, bioinformatics, bioacoustics, and tools like Podman, R/Quarto, Shiny, and DeepMind Perch 2.0

🧑‍💻 Positron, IDEs, and reproducible R development

Positron Assistant: GitHub Copilot and Claude-Powered Agentic Coding in R (gettinggeneticsdone​.blogspot​.com). Positron Assistant brings GitHub Copilot inline code and Claude-powered chat/agent mode to R in Positron for AI-assisted coding and package development

Repost: Make your development environment portable and reproducible (gettinggeneticsdone​.blogspot​.com). Strategies for portable, reproducible dev environments using dotfiles, scripts, VS Code dev containers, and selective tools across VMs and clouds

Make your development environment portable and reproducible (blog​.stephenturner​.us). Portable, reproducible dev environments using dotfiles, VM setup scripts, R verse packages, and VS Code dev containers

Positron for Product Owners (posit​.co). Positron enhances product ownership workflows with GitHub integration, Quarto planning, testing, multi-project management, and Python/R package management in a VSCode-like IDE

What’s New in Posit Package Manager: September 2025 (posit​.co). Posit Package Manager September 2025 adds SSO (OIDC), ARM64 Linux support, new Linux R repo URLs, and fine-grained CVE-based vulnerability blocking

2025-09-12 AI Newsletter (posit​.co). Posit introduces Positron editor for Python, R; covers RStudio/Jupyter/VS Code management, package repositories, Shiny sharing, Databricks integration, and AI/news updates including Anthropic settlement and agentic browsers

Deploy to Connect Cloud from Positron and VS Code (posit​.co). Deploy Python and R content to Connect Cloud from Positron or VS Code using the Posit Publisher extension

🎨 ggplot2, palettes, and R visualisation workflows

Tidy RNA sequencing visualization (cosminribo​.eu). Explores RNA-seq visualization with BigWig/WIG conversion, Gviz, tidyCoverage, ggtranscript, and patchwork for annotated PDFs

Améliorer la lisibilité des étiquettes de vos graphiques avec le package ggrepel (delladata​.fr). Utilise ggrepel pour repositionner et lisser les étiquettes dans ggplot2 avec geom_text_repel et geom_label_repel, et compare avec geom_text et geom_label

Linear cosine palettes (blog​.djnavarro​.net). Generative art in R: linear cosine palettes using a, b, c, d bases and a t parameter

Introducing The Art of Visualization with ggplot2 (nrennie​.rbind​.io). Nicola Rennie presents The Art of Visualization with ggplot2, a data-visualization case-study book with end-to-end R workflows and TidyTuesday-derived examples

ggplot2 4.0.0 (tidyverse​.org). ggplot2 4.0.0 release notes covering S7 migration, themes, scales, discrete positions, labeling, and new helpers

Recipe Complexity Correlates with Higher Ratings (stevenponce​.netlify​.app). Heat map analysis links recipe complexity (ingredient count and cooking time) to higher AllRecipes ratings using R tidyverse, tidytuesdayR, viridis, and ggplot2

Little useless-useful R functions – Markov babbler (tomaztsql​.wordpress​.com). R snippet builds Markov-like word transitions, tokenization, and ggplot visualization of word networks in R

📚 Academic Research

Memshare: Memory Sharing for Multicore Computation in R with an Application to Feature Selection by Mutual Information using PDE (arxiv:cs). Memshare enables shared memory multicore computation in R with ALTREP, showing 2x speedups and PDE-based mutual information feature selection

Network Meta-Analysis of survival outcomes with non-proportional hazards using flexible M-splines (arxiv:stat). Flexible NMA of survival with non-proportional hazards via M-splines and weighted random walk priors in multinma

👋 Before you go

I've got a big favor to ask - keeping Blaze running isn't expensive, but it does all add up, so I'm asking readers like you to help, if you can.
That's why I'm launching a Patreon page!. Nothing flashy, just a way for folks who find value in these newsletters to chip in a little each month. In return, you'll get:

  • Real say in how Blaze evolves — vote on new topics, features, topic curation ideas
  • First dibs on merch (details still cooking)
  • That warm fuzzy feeling knowing you're supporting something that saves you time and keeps you plugged into great tech writing

If you are getting value from blaze, checking this out would mean the world. And if you can't contribute, no worries—the newsletters keep coming either way, and you can follow along on patreon for free.
Thanks for reading and being part of this nerdy corner of the internet. All the best - Alastair.

You may also like

About Data Scientist (with R)

Our Data Scientist newsletter covers the latest developments, packages, techniques, and insights in R programming and data science. Each week, we curate the most important content from your favourite R blogs so you don't have to spend hours searching.

Whether you're a beginner or expert in data science with R, our newsletter provides valuable information to keep you informed.

Subscribe now to join thousands of professionals who receive our weekly updates!