Data Scientist (with R): 23rd September 2025
Published 23rd September 2025
📰 R community news, events, and Posit updates
Weekly recap (Sep 19, 2025) (blog.stephenturner.us). AI-generated genomes, biosafety, mirror life, AI in teaching, CRAN defense, Posit news, and OpenAI ChatGPT usage insights
Generative AI and R workshops in Hobart Australia (seascapemodels.org). Two-day in-person R and AI workshop in Hobart (Nov 11–12, 2025) covering AI-assisted coding, GLMs in R, with Chris Brown and Anthony Richardson
R Weekly 2025-W39 Parse RMarkdown, Vibe Code, Quality Control (rweekly.org). Weekly news in the R Community: Parse RMarkdown/Quarto, Vibe Code, quality control, and package updates
posit::conf(2025) - Atlanta, USA (ropensci.org). ROpenSci posit::conf(2025) in Atlanta features lightning talks and sessions on R ecosystem, Positron from RStudio, targets, LLM productivity, and community building
Positron at posit::conf(2025) Roundup (posit.co). Positron at posit::conf(2025) Roundup highlights Positron launch, Python/R integration, centralized management, package repositories, and Shiny apps via Posit platform
🛠️ Data engineering, cleaning, and reproducibility in R
Jumping in the Ducklake with nothing but R on (discindo.org). Explores using DuckLake with R and DuckDB to implement medallion-style pipelines (bronze/silver/gold), writing data, lineage tracking, and visualization
Research note on the Radical Right Research Robot (kai-arzheimer.com). Autonomous Radical Right Research Robot (RRRR) in R, posting references on radical right literature and shifting to Mastodon/Bluesky amid Twitter changes
All the Ways to Programmatically Edit or Parse R Markdown / Quarto Documents (ropensci.org). Overview of R Markdown/Quarto parsing and editing in R, highlighting tinkr, md4r, Pandoc, parseqmd, parsermd, lightparser, string utilities, and frontmatter handling
Easily clean up messy databases with fuzzy matching in R (storybench.org). Fuzzy matching in R with stringdist to clean messy session names on an IRE 2025 schedule
Bug affecting neonUtilities in latest RStudio version on Windows (neonscience.org). Windows users of neonUtilities advised not to update RStudio 2025.09 due to downloader package conflict
🎨 Visualization, ggplot2, and publication-ready tables in R
Aesthetics Evaluation Control in ggplot (jmsallan.netlify.app). A practical guide to controlling aesthetic evaluation in ggplot2 using after_stat and related stats
How many cars are there in Madison? (haraldkliems.netlify.app). Car ownership in Madison rises with households; ACS data visualized in R
Recreating APA Manual Table 7.12 in R with apa7 (wjschne.github.io). Recreating APA Table 7.12 in R using apa7, flextable, ftExtra, and tidyverse with simulated data and aov analyses
Recreating APA Manual Table 7.7 in R with apa7 (wjschne.github.io). Recreating APA Table 7.7 in R using apa7, flextable, ftExtra, tidyverse, and chi-square calculations
🗺️ Spatial data, mapping, and geospatial workflows in R
Artpack 0.2.0 (thetidytrekker.com). Geospatial point_in_polygon with sf, group tools, seq_bounce, resizer, set_brightness, set_saturation, and ggplot2 visualizations for artpack 0.2.0
Phoenician colonization (r.iresmi.net). Reconstructs Phoenician colonization data via CSVs, tidyverse in R, parzer, sf, and leaflet
Drop #713 (2025-09-19): AVAST ME HEARTIES! (dailydrop.hrbrmstr.dev). Using R and DuckDB to access and process Maritime Safety Information and piracy data from ASAM/GeoJSON sources
Connecting the dots with R (aliceinstatisticsland.wordpress.com). Live stream of Ihaka lecture on spatial statistics, spatstat, ppm(), and spatial modelling with R
🧠 Modeling packages, explainability, and applied analyses in R
Frequently Asked Questions (metafor-project.org). Overview of metafor package, validation, funding, citing, comparisons with other software, and technical details on I2/H2, R2, prediction intervals, transformations, and Mantel-Haenszel results
Replication Forensics: A Learning Experience for Students (svmiller.com). Replication forensics in R: exploring Benoit 1996 data, WEEDE.ASC handling, and converting to modern formats with dplyr, read_table, and read_file
Key improvements in shapviz and kernelshap (lorentzen.ch). Shapviz and kernelshap updates with GLM and XGBoost SHAP explanations and interactions
Chess Dreams and Breakthroughs: A Global Perspective (stevenponce.netlify.app). Global chess ratings analysis reveals patterns in activity, breakthroughs, and federations with FIDE data
📈 Statistical inference, probability, and causal analysis with R
T test in R (codingthepast.com). T test in R: perform t.test, bootstrap approach, Titanic data, and bootstrap-based p-values in R
Some notes on probability judgement (blog.djnavarro.net). Calibrations of human probability judgments using YouGov data; explains 21% trans figure via Tversky–Kahneman heuristics and simple error models in R
Causal Inference in R (lucymcgowan.com). Causal diagrams, propensity scores, and inverse probability weighting in R for causal questions using tidyverse tools
Generating Synthetic Data with R-vine Copulas using esgtoolkit in R (thierrymoudiki.github.io). Tutorial on generating synthetic data with R-vine copulas using esgtoolkit in R and RVineModel fitting
Week 3, days 5 and 6, plus a story about whale watching (causalinf.substack.com). Harvard lecturer blogs about classroom experiences, causal inference lessons, local Boston trip, motion sickness on a boat, and personal growth
Analysis of Sales Shift in Retail with Causal Impact: A Case Study at Carrefour (towardsdatascience.com). Causal Impact analyzes sales shift after product unavailability using Bayesian structural time-series with covariates and a synthetic control in Carrefour case study
Estimating rare proportions (statschat.org.nz). Estimating rare proportions and biases in small vs large proportions in surveys, with references to Navarro and Gelman
📚 Academic Research
Efficient and Accessible Discrete Choice Experiments: The DCEtool Package for R (arxiv:econ). DCEtool: an R package with a Shiny interface for efficient, accessible discrete choice design, decoding, and analysis
👋 Before you go
I've got a big favor to ask - keeping Blaze running isn't expensive, but it does all add up, so I'm asking readers like you to help, if you can.
That's why I'm launching a Patreon page!. Nothing flashy, just a way for folks who find value in these newsletters to chip in a little each month. In return, you'll get:
- Real say in how Blaze evolves — vote on new topics, features, topic curation ideas
- First dibs on merch (details still cooking)
- That warm fuzzy feeling knowing you're supporting something that saves you time and keeps you plugged into great tech writing
If you are getting value from blaze, checking this out would mean the world. And if you can't contribute, no worries—the newsletters keep coming either way, and you can follow along on patreon for free.
Thanks for reading and being part of this nerdy corner of the internet. All the best - Alastair.
You may also like
About Data Scientist (with R)
Our Data Scientist newsletter covers the latest developments, packages, techniques, and insights in R programming and data science. Each week, we curate the most important content from your favourite R blogs so you don't have to spend hours searching.
Whether you're a beginner or expert in data science with R, our newsletter provides valuable information to keep you informed.
Subscribe now to join thousands of professionals who receive our weekly updates!