
2025-06-16 10:00:01
Tidy Modeling with R: #rstats #machinelearning
Tidy Modeling with R: #rstats #machinelearning
We're making hand-made buttons/pins for the @… booth at @…
Any ideas for good motifs anyone?
#rstats
Are you interested in how dependency-heavy your (or another) package is and why? #rstats
Beside the {report} package (yesterdays note) there are more tools in the easystats collection. #rstats
Add some swag to your ggplots, with fontawesome symbols and colors: #rstats
Using fonts in R graphics can be tricky at times. {showtext} aims to make it easier: #rstats
Extract tables from pdfs with {tabulapdf} #rstats #datasciece
Automatically describe data and models as text using the {report} package. #rstats
Sometimes (often) one ends up needing to run older versions of R using older versions of packages. Evercran might be just the tool to help with that: #RStats
Get 9-30x speed doing areal-weighted interpolation with my new {𝐝𝐮𝐜𝐤𝐬𝐟} #rstats package compared to {sf}/{areal}. Experimental, but tested against both {areal} and {sf}. https://github.com/e-kotov/ducksf . Despite…
Need some data to test a plot idea or algorithm? On #rstats #synthetic…
There are frameworks like {golem} and {rhino} to make shiny development more robust, but I like the concept of {shinytest2} in providing a testing framework for pure shiny. https://rstudio.github.io/shinytest2/index.html
{nplyr} has helper functions to work on nested dataframes: #rstats #datascience
Polars is a lightning fast DataFrame library/in-memory query engine with parallel execution and cache efficiency. And now you can use is with the tidyverse syntax: #rstats
Here's an ad that Google Opal created for #rstats #spanishoddata https://ropenspain.github.io/spanishod
Im using case_when() quite a lot, case_match() is new to me: #rstats
Here's an ad that Google Opal created for #rstats #spanishoddata https://ropenspain.github.io/spanishod
Highlight a certain aspect of your data in ggplot: #rstats
{piggyback} makes it easier to attach large files (e.g. input data) to code in github repos: #rstats
There are many situations were you need access to different R versions: rig is a way to manage them #rstats
If you use Quarto to make presentations for a professional setting, it is important to choose the right theme, e.g. #rstats
Sometimes you get data in less than optimal format, e.g. as a png of a figure 😭... In that case https://cran.r-project.org/web/packages/metaDigitise/vignettes/metaDigitise.html might be the rescue.
TidyX: screencasts explaining different aspects of the R language and the coding process. #rstats
Not sure any longer which libraries your script actually needs? #rstats
📝🗃️ 𝗿𝗱𝗼𝗰𝗱𝘂𝗺𝗽: Dump ‘R’ Package Source, Documentation, and Vignettes into One File for use in LLMs #rstats #LLM is on CRAN https://www.ekotov.pro/rdocdum…
Find the best contrast between one color and a list of options, e.g. for labels in geom_tile: {prismatic::best_contrast()} https://emilhvitfeldt.github.io/prismatic/reference/best_contrast.html
Are you making slides with Quarto or R Markdown and need a timer e.g. for breaks or group work? There is the {countdown} package for you: #rstats
A pictures says more than 1000 words. How much more can an audio representation of your data tell you? #rstats
Getting started with Shiny to make interactive web-apps with R: #rstats
The {esquisse} package makes it easy to plot your data in different ways with a drag and drop interface: #rstats
Do you need better performance than what the standard #tidyverse functions have? {collapse} might be worth a look: https://sebkrantz.github.io/collapse/
The dbcooper package turns a database connection into a collection of functions. #rstats
{slider} helps with aggregation over (sliding) windows, both index and time period based: #rstats
Follow along when @… walks you through how she tackles a new dataset: https://www.youtube.com/c/JuliaSilge
Primer to get you started with Optimization and Mathematical Programming in R #rstats
Use multi level models with {parsnip}: http://multilevelmod.tidymodels.org/ #rstats #ML
A curated list of awesome tools to assist 📦 development in R programming language. #rstats #📦
The inner working of parquette/arrow data in R: #rstats
The functions in the {withr} package allow to change your environment temporarily. E.g. create a temp file for a {testthat} test and clean it up afterwards. #rstats
{dtrack} makes documentation of data wrangling part of the analysis and creates pretty flow charts: #rstats
The {purrr} package is a powerfull way to replace loops. The {furrr} package takes this approach one step further by parallel execution: #rstats
GitHub Actions for the R language: Makes automatic testing of your R package much easier and making sure your package works on different OS and R versions is a matter of just a few lines of yaml: #rstats
{annotater}: Annotate package load calls, so we can have an idea of the overall purpose of the libraries we’re loading: #rstats
Make sure your code follows a consitent style using the {lintr} package. #rstats
{spiralize} can be used to highlight cyclic data, e.g. multi year time series. #rstats
Lets be honest, we spend too much time cleaning data. {janitor} can help with that: #rstats
{testthat} is great for automatic testing. Here are some tricks for the heavy user: #rstats
{ivs} makes it easier to work with intervals: #rstats
Function-oriented Make-like declarative workflows for R #rstats
A template for data analysis projects structured as R packages (or not) #rstats #datascience
Cute comics of R functions by @…: https://allisonhorst.com/r-packages-functions
Do you (sometimes) use print() or message() for debugging your code? Next time you can use {icecream} instead: #rstats
I am more fluent in LaTeX than in plotmath expression. If you are the same, latex2exp will make your life easier. https://cran.r-project.org/web/packages/latex2exp/vignettes/using-latex2exp.html
Its good to have many tests in your R package, but it can be a pain to debug some failing tests when it happens. {lazytest} for the rescue: only rerun the failing tests, until they pass: #RStats
Friends Don't Let Friends Make Bad Graphs! Do you agree with the examples of bad graphs and the alternatives Chenxin Li (@chenxinli2.bsky.social) lists at https://github.com/cxli233/FriendsDontLetFriends
Easier debugging of piped analyses in R: https://github.com/MilesMcBain/breakerofchains by @…
If you set limits for a scale (e.g. x-axis) in ggplot, how would you like data outside of that range be handled? There is the oob parameter for that and a set of functions to use with it: https://scales.r-lib.org/reference/oob.html
{constructive} prints code that can be used to recreate R objects. Like dput, but better... #rstats