Data Scientist (with R): 20th May 2025
📢 Community & Announcements
15 New Books added to Big Book of R (oscarbaruffa.com, 2025-05-18). New additions to the Big Book of R include 6 English and 9 Portuguese titles covering ggplot2, forensic metascience, low-compute machine learning, DuckDB recipes, regression analysis, and Bayesian methods with hidden Markov models
Call For Talks (rinpharma.com, 2025-05-18). The call for talks for Virtual R/Pharma 2025 is now open, seeking engaging presentations on R in pharma, with submissions due by June 30. Interested speakers can receive free coaching
Bridges and Communities. My Journey as an rOpenSci Champion (ropensci.org, 2025-05-15). My journey as an rOpenSci Champion involved developing the R package ARcenso for accessing Argentina's census data, using tools like GitHub Actions and benefiting from mentorship and community support
Registration for posit::conf(2025)’s virtual experience is now open (posit.co, 2025-05-15). Registration for posit::conf(2025) is live, featuring virtual access to keynotes, talks, and community connections from September 16-18, highlighting tools like RStudio, Jupyter, and package management for Python and R
Drop #655 (2025-05-19): Just Another Manic Monday (dailydrop.hrbrmstr.dev, 2025-05-19). The latest Drop discusses 'rv' for R dependency management, and 'Jetrelay,' a high-performance relay for Bluesky's jetstream feed, emphasizing solution-focused questioning to improve team productivity
🎨 R Visualizations
spuriouscorrelations: An R package to show examples about spurious correlations (pacha.dev, 2025-05-17). The spuriouscorrelations R package demonstrates examples of false correlations, such as the link between pool drownings and Nicholas Cage films, using tools like dplyr and ggplot2 for data analysis and visualization
Animated population pyramids for the Pacific by @ellis2013nz (freerangestats.info, 2025-05-17). This post describes creating animated population pyramids for Pacific island countries from 1950 to 2050 using R, demonstrating the process with tidyverse and rsdmx libraries
Interactive charts in R and beyond (nrennie.rbind.io, 2025-05-15). Explore various methods for creating interactive charts utilizing R, tooltips, dropdown menus, and other tools, all browser-based without requiring a server, presented by data visualization specialist Nicola Rennie
🛠️ R Programming Tutorials
Mock Them All: Simulate to Better Test with testthat (rtask.thinkr.fr, 2025-05-15). Learn to use R's testthat package for mocking user interactions and simulating external resource testing, avoiding real API calls and time-consuming dialog boxes while enhancing automation in unit testing
How to use GitHub Codespaces to simplify your Quarto workshops (quarto.org, 2025-05-19). Use GitHub Codespaces to streamline Quarto workshops with a pre-configured cloud environment, enabling immediate onboarding, reproducible workflows, and consistent tools across participants' devices, simplifying local setup challenges
Introducing dockViewR 0.1.0: a layout manager for R and Shiny. (cynkra.com, 2025-05-15). dockViewR 0.1.0 is a new R and Shiny layout manager allowing customizable grid layouts, dynamic panel management, and support for multiple themes. It integrates with Shiny and R Markdown for enhanced user experience
How to Rename Columns in data.table in R (With Examples) (marsja.se, 2025-05-13). Learn how to rename columns in data.table using setnames() in R, including techniques for renaming by name, by position, and multiple columns simultaneously, enhancing your data organization and cleaning processes
Writing a book with Quarto (blog.stephenturner.us, 2025-05-19). Turn RMarkdown documents into a book with Quarto in under an hour. Utilize features like cross-referencing, interactive code blocks, and publish options for PDF and EPUB formats
🔬 Academic & Methodological
When good pseudorandom numbers go bad (blog.djnavarro.net, 2025-05-17). Multivariate normal sampling can yield unpredictable results due to floating point issues; even with set.seed() in R, reproducing results across machines can be challenging for practitioners
From Complete Separation To Maximum Likelihood Estimation in Logistic Regresion: A Note To Myself (kenkoonwong.com, 2025-05-17). Logistic regression can face issues with complete separation, leading to extreme standard errors. Understanding maximum likelihood estimation through calculus concepts like the chain and quotient rules is crucial for accurate modeling
Notes on DEseq2 design (onetipperday.blogspot.com, 2025-05-13). Notes on DEseq2 design highlight the importance of modeling subjects as random effects while applying likelihood ratio tests. Key design formulas include factors like age, sex, group, and time
A bounded continuous character evolution model with absorbing boundaries for phytools (blog.phytools.org, 2025-05-13). A bounded continuous character evolution model is introduced for phylogenetic analysis, featuring absorbing boundaries and utilizing the discretized diffusion approximation for simulating evolutionary processes and fitting multi-rate models
Tracking animals with particles (methodsblog.com, 2025-05-15). Research on tracking the Critically Endangered flapper skate in Scotland employs acoustic telemetry and advanced statistical models, including particle filters and smoothing techniques, to estimate animal locations and inform conservation efforts
Spatial Resampling in Predictive Modelling (jmsallan.netlify.app, 2025-05-15). Explore spatial resampling in predictive modeling using tidymodels and spatialsample, demonstrated through the cat_adoption dataset to predict rescue cat outcomes based on geographic attributes
Cost-Effectiveness Analysis (rworks.dev, 2025-05-19). Cost-effectiveness analysis is demonstrated using Bayesian models in JAGS, evaluating prophylactic antibiotics for cesarean sections, highlighting deterministic and stochastic decision frameworks, ICER calculations, and relevant healthcare economic concepts
You may also like
About Data Scientist (with R)
Our Data Scientist newsletter covers the latest developments, packages, techniques, and insights in R programming and data science. Each week, we curate the most important content from your favourite R blogs so you don't have to spend hours searching.
Whether you're a beginner or expert in data science with R, our newsletter provides valuable information to keep you informed.
Subscribe now to join thousands of professionals who receive our weekly updates!