๐Ÿ“Š

Data Scientist (with R): 27th May 2025

Newsletters sent once a week, unsubscribe anytime.

Published 27th May 2025

๐Ÿ“š Learning, Community & Technical Topics

Sensitivity to C math library and mingw-w64 v12 - part 2 (blog.r-project.org, 2025-05-21). Differences in C math library results on Windows with mingw-w64 v12 affect R packages; subnormal values and precision variances were observed on various Windows versions and configurations, impacting tests and numerical accuracy

data.table vs.ย base vs.ย dplyr (arelbundock.com, 2025-05-23). A comprehensive comparison between data manipulation operations in R using data.table, base R, and dplyr, demonstrating syntax for filtering, sorting, and advanced operations like joins and data reshaping

The 80/20 Guide to R You Wish You Read Years Ago (borkar.substack.com, 2025-05-22). This guide emphasizes small habits in R programming that lead to improved workflows, covering tools like Tidyverse, data.table, DuckDB, and concepts such as vectorization, functional programming, and parallelization

Forwards To Offer R Package Development Workshops Online (forwards.github.io, 2025-05-21). Forwards is offering online workshops on R package development, covering topics like structure, documentation, dependency management, error checking, and distribution on platforms such as CRAN and GitHub, starting June 2025

rOpenSci News Digest, May 2025 (ropensci.org, 2025-05-26). rOpenSci's May 2025 digest highlights calls for maintainers for pkgcheck, new software packages like forcis and promoutils, and updates on the Champions Program and Software Peer Review process

๐Ÿ”ง R Package Development & New Tools

Refactoring code with flir (etiennebacher.com, 2025-05-23). flir is a fast tool for detecting and automatically fixing bad practices in R code, enhancing readability and performance, akin to an extension of lintr but with broader capabilities in code refactoring

New data cleaning/preperation functions for the Diel.Niche R package (masonfidino.com, 2025-05-22). New data cleaning functions for the Diel.Niche R package include trim.time(), make.diel.bin.list(), and bin.diel.times(), aiding in the analysis of species diel phenotypes using camera trap data

mirai 2.3.0 (shikokuchuo.net, 2025-05-23). Mirai 2.3.0 enhances asynchronous evaluation in R with improved timeout management, serialization, network discovery, and support for HPC environments, cementing its role in modern computing and integration with tools like Shiny and purrr

A semi-threshold trait evolution model in phytools (blog.phytools.org, 2025-05-23). The semi-threshold trait evolution model in phytools uses a discretized diffusion approximation to analyze traits based on hidden continuous properties, showcased using the fitsemiThresh function for trait fitting and visualization

Introducing cheetahR: A Lightning-Fast HTML Table widget for R (cynkra.com, 2025-05-23). CheetahR is a high-performance HTML table widget for R designed for rapid rendering and customization, leveraging Cheetah Grid's capabilities for interactive data exploration and seamless Shiny integration

๐Ÿ’ป Web Development & Publishing

Repost: Writing a book with Quarto (gettinggeneticsdone.blogspot.com, 2025-05-23). Explore Quarto for publishing, a successor to RMarkdown enabling multi-format outputs like HTML and PDF. Learn how it transformed course materials into a polished e-book with interactive code blocks and various document types

Security blind spots in Shiny: why your app is more vulnerable than you think (rtask.thinkr.fr, 2025-05-21). Shiny developers face security vulnerabilities like XSS, command injection, and SQL injection, revealing potential risks in user input handling, code evaluation, and database queries, emphasizing the need for caution and protective measures

Shiny in Production 2025: Workshops (jumpingrivers.com, 2025-05-20). Shiny in Production 2025 offers workshops on end-to-end testing with Playwright, asynchronous Shiny, mapping with leaflet, and UI design using Figma, enabling hands-on learning for all skill levels

๐Ÿ“Š Data Visualization & Communication

Little useless-useful R functions โ€“ Absurd bias DAG with useless mental shortcuts (tomaztsql.wordpress.com, 2025-05-25). Explore cognitive biases using R to create a Directed Acyclic Graph (DAG) with Python tools like igraph and ggraph, visualizing irrational connections influenced by psychological effects and mental shortcuts

Custom PowerPoints Using {officer} (jumpingrivers.com, 2025-05-22). Utilize the officer R package for customizable PowerPoint presentations beyond Quartoโ€™s limits, enabling complete control over layouts and content insertion, from text to data visualizations using R scripts

A step-by-step chart makeover (nrennie.rbind.io, 2025-05-22). Nicola Rennie details a step-by-step process to improve a poorly designed chart, focusing on color choices, axis handling, and data limitations to effectively visualize income inequality across countries for 2013 and 2023

Engaging and effective data visualisations (nrennie.rbind.io, 2025-05-27). Nicola Rennie's talk highlights the importance of data visualisation, offering guidelines for better chart-making, showcasing the Royal Statistical Society's guide on best practices, and discussing examples of effective and ineffective visualisations

๐Ÿ“ˆ Statistical Modeling & Research

Random Effects Regression: A Mathematical Exposition (bas-m.netlify.app, 2025-05-23). An exposition on random effects regression explores its mathematical underpinnings, comparing it to OLS and fixed effects, emphasizing intra-class correlation coefficient, GLS estimation, and the relationship between random effects and fixed effects

easily computed marginal likelihoods for multivariate mixture models using the THAMES estimator (xianblog.wordpress.com, 2025-05-24). THAMES Monte Carlo method is adapted for marginal likelihoods in multivariate mixture models, addressing issues like label switching and computational complexity while critiquing existing methodologies and exploring the implications of model misspecification

Five essential models for data scientists in finance (posit.co, 2025-05-20). Explore essential models for data scientists in finance, utilizing tools like RStudio, Jupyter, and VS Code, while managing packages and sharing insights through Shiny applications and scalable repositories

Estimating Product-Level Price Elasticities Using Hierarchical Bayesian (towardsdatascience.com, 2025-05-23). Hierarchical Bayesian modeling estimates product-level price elasticities, allowing for granular demand analysis across categories, utilizing techniques like log-linear regression and Bayesian updating for enhanced predictive strength and minimization of standard errors

A Bayesian Approach for Inference on Mixed Graphical Models (arxiv:stat, 2025-05-21). A Bayesian pairwise graphical model estimates conditional independencies for mixed data, utilizing spike-and-slab priors and MCMC algorithms. Analysis reveals changes in associations from adolescents' eating disorder treatments

You may also like

About Data Scientist (with R)

Our Data Scientist newsletter covers the latest developments, packages, techniques, and insights in R programming and data science. Each week, we curate the most important content from your favourite R blogs so you don't have to spend hours searching.

Whether you're a beginner or expert in data science with R, our newsletter provides valuable information to keep you informed.

Subscribe now to join thousands of professionals who receive our weekly updates!