Tootfinder

Opt-in global Mastodon full text search. Join the index!

@datascience@genomic.social
2025-06-16 10:00:01

Tidy Modeling with R: #rstats #machinelearning

@EgorKotov@datasci.social
2025-06-16 08:10:44

🚀 𝘀𝗽𝗮𝗻𝗶𝘀𝗵𝗼𝗱𝗱𝗮𝘁𝗮 0.2.0 is here. As before, you are getting nicely formatted Open Mobility Big Data released by the Spanish Ministry of Transport and Sustainable Mobility (MITMS) in a reproducible way. #rstats #opendata

banner advertising key changes in spanishoddata 0.2.0
@HeidiSeibold@fosstodon.org
2025-07-15 13:58:24

We're making hand-made buttons/pins for the @… booth at @…
Any ideas for good motifs anyone?
#rstats

@datascience@genomic.social
2025-08-17 10:00:01

Are you interested in how dependency-heavy your (or another) package is and why? #rstats

@datascience@genomic.social
2025-07-16 10:00:01

Beside the {report} package (yesterdays note) there are more tools in the easystats collection. #rstats

@datascience@genomic.social
2025-05-17 10:00:01

Add some swag to your ggplots, with fontawesome symbols and colors: #rstats

@datascience@genomic.social
2025-08-16 10:00:01

Using fonts in R graphics can be tricky at times. {showtext} aims to make it easier: #rstats

@datascience@genomic.social
2025-06-15 10:00:01

Extract tables from pdfs with {tabulapdf} #rstats #datasciece

@datascience@genomic.social
2025-07-15 10:00:01

Automatically describe data and models as text using the {report} package. #rstats

@datascience@genomic.social
2025-06-17 10:00:01

Sometimes (often) one ends up needing to run older versions of R using older versions of packages. Evercran might be just the tool to help with that: #RStats

@EgorKotov@datasci.social
2025-08-10 23:26:51

Get 9-30x speed doing areal-weighted interpolation with my new {𝐝𝐮𝐜𝐤𝐬𝐟} #rstats package compared to {sf}/{areal}. Experimental, but tested against both {areal} and {sf}. github.com/e-kotov/ducksf . Despite…

Benchmark scatter plot comparing areal-weighted interpolation runtimes (seconds) vs. peak memory use (MB) for three R package backends. {ducksf} with DuckDB backend is fastest (8.3s, 240MB, bottom left, green). {geosareal} with GEOS backend is mid-range (35.2s, 623MB, center, blue). {areal} with sf backend is slowest (230.7s, 1546MB, top right, orange). Ducksf logo in top-left corner.
Benchmark scatter plot comparing areal-weighted interpolation runtimes (seconds) vs. peak memory use (MB) for three R package backends. {ducksf} with DuckDB backend is fastest (8.3s, 240MB, bottom left, green). {geosareal} with GEOS backend is mid-range (35.2s, 623MB, center, blue). {areal} with sf backend is slowest (230.7s, 1546MB, top right, orange). Ducksf logo in top-left corner.
Benchmark scatter plot comparing areal-weighted interpolation runtimes (seconds) vs. peak memory use (MB) for three R package backends. {ducksf} with DuckDB backend is fastest (30.6s, 240MB, bottom left, green). {geosareal} with GEOS backend is mid-range (35.9s, 623MB, center, blue). {areal} with sf backend is slowest (284.1s, 1581MB, top right, orange). Ducksf logo in top-left corner.
@datascience@genomic.social
2025-08-14 10:00:00

Need some data to test a plot idea or algorithm? On #rstats #synthetic

@datascience@genomic.social
2025-07-13 10:00:01

If you feel you should be reading more: #rstats #ebooks

@datascience@genomic.social
2025-07-17 10:00:01

There are frameworks like {golem} and {rhino} to make shiny development more robust, but I like the concept of {shinytest2} in providing a testing framework for pure shiny. rstudio.github.io/shinytest2/i

@datascience@genomic.social
2025-06-12 10:00:01

{nplyr} has helper functions to work on nested dataframes: #rstats #datascience

@datascience@genomic.social
2025-07-12 10:00:00

Polars is a lightning fast DataFrame library/in-memory query engine with parallel execution and cache efficiency. And now you can use is with the tidyverse syntax: #rstats

@EgorKotov@datasci.social
2025-07-27 12:50:35

Here's an ad that Google Opal created for #rstats #spanishoddata ropenspain.github.io/spanishod

@datascience@genomic.social
2025-07-11 10:00:01

{ggdist}: Visualizations of distributions and uncertainty #rstats #ggplot

@datascience@genomic.social
2025-06-11 10:00:01

Im using case_when() quite a lot, case_match() is new to me: #rstats

@EgorKotov@datasci.social
2025-07-27 12:50:35

Here's an ad that Google Opal created for #rstats #spanishoddata ropenspain.github.io/spanishod

@datascience@genomic.social
2025-06-10 10:00:01

Highlight a certain aspect of your data in ggplot: #rstats

@datascience@genomic.social
2025-07-10 10:00:01

{piggyback} makes it easier to attach large files (e.g. input data) to code in github repos: #rstats

@EgorKotov@datasci.social
2025-07-29 10:38:26

The cool things you sometimes find in the CRAN submission queue. #rstats

@datascience@genomic.social
2025-06-09 10:00:01

There are many situations were you need access to different R versions: rig is a way to manage them #rstats

@EgorKotov@datasci.social
2025-07-29 10:38:26

The cool things you sometimes find in the CRAN submission queue. #rstats

@datascience@genomic.social
2025-08-09 10:00:01

If you use Quarto to make presentations for a professional setting, it is important to choose the right theme, e.g. #rstats

@datascience@genomic.social
2025-08-13 10:00:01

Sometimes you get data in less than optimal format, e.g. as a png of a figure 😭... In that case cran.r-project.org/web/package might be the rescue.

@datascience@genomic.social
2025-08-08 10:00:01

TidyX: screencasts explaining different aspects of the R language and the coding process. #rstats

@datascience@genomic.social
2025-06-08 10:00:01

Not sure any longer which libraries your script actually needs? #rstats

@datascience@genomic.social
2025-06-07 10:00:00

Beautiful palettes based on art for R and python: #rstats #ggplot

@EgorKotov@datasci.social
2025-06-18 16:12:16

📝🗃️ 𝗿𝗱𝗼𝗰𝗱𝘂𝗺𝗽: Dump ‘R’ Package Source, Documentation, and Vignettes into One File for use in LLMs #rstats #LLM is on CRAN ekotov.pro/rdocdum…

rdocdump
Get fresh package docs to pass to LLM
library(rdocdump)
rdd_to_txt(
pkg = "aws.s3"
output_file = "aws.s3.txt",
force_fetch = TRUE)
github.com/e-kotov/rdocdump
@datascience@genomic.social
2025-08-12 10:00:01

Find the best contrast between one color and a list of options, e.g. for labels in geom_tile: {prismatic::best_contrast()} emilhvitfeldt.github.io/prisma

@datascience@genomic.social
2025-06-06 10:00:01

Everything is a linear model: #rstats #stats

@datascience@genomic.social
2025-07-05 10:00:01

{ggbump} creates elegant bump charts in ggplot. #ggplot #rstats

@datascience@genomic.social
2025-08-05 10:00:01

Are you making slides with Quarto or R Markdown and need a timer e.g. for breaks or group work? There is the {countdown} package for you: #rstats

@datascience@genomic.social
2025-06-05 10:00:01

A pictures says more than 1000 words. How much more can an audio representation of your data tell you? #rstats

@datascience@genomic.social
2025-07-04 10:00:01

Getting started with Shiny to make interactive web-apps with R: #rstats

@datascience@genomic.social
2025-06-04 10:00:01

The {esquisse} package makes it easy to plot your data in different ways with a drag and drop interface: #rstats

@datascience@genomic.social
2025-08-11 10:00:00

This looks helpful: #rstats #optimization

@datascience@genomic.social
2025-08-03 10:00:01

Keynote from rstudio::conf 2022: The past and future of shiny. #rstats

@datascience@genomic.social
2025-08-10 10:00:01

Do you need better performance than what the standard #tidyverse functions have? {collapse} might be worth a look: sebkrantz.github.io/collapse/

@datascience@genomic.social
2025-06-02 10:00:00

Use cookies in shiny apps: #rstats #shiny

@datascience@genomic.social
2025-07-02 10:00:01

The dbcooper package turns a database connection into a collection of functions. #rstats

@datascience@genomic.social
2025-07-31 10:00:01

{slider} helps with aggregation over (sliding) windows, both index and time period based: #rstats

@datascience@genomic.social
2025-07-30 10:00:01

{lubridate} makes working with dates in R just that little bit easier: #rstats #dates

@datascience@genomic.social
2025-07-08 10:00:01

Follow along when @… walks you through how she tackles a new dataset: youtube.com/c/JuliaSilge

@datascience@genomic.social
2025-07-09 10:00:01

Primer to get you started with Optimization and Mathematical Programming in R #rstats

@datascience@genomic.social
2025-07-29 10:00:01

Keep track of the TODO notes in your code: #rstats #todo

@datascience@genomic.social
2025-05-28 10:00:01

Use multi level models with {parsnip}: multilevelmod.tidymodels.org/ #rstats #ML

@datascience@genomic.social
2025-06-27 10:00:01

paint() your data when you print them to make them easier to grasp: #rstats #console

@datascience@genomic.social
2025-07-28 10:00:01

A curated list of awesome tools to assist 📦 development in R programming language. #rstats #📦

@datascience@genomic.social
2025-05-27 10:00:02

The inner working of parquette/arrow data in R: #rstats

@datascience@genomic.social
2025-07-27 10:00:01

The functions in the {withr} package allow to change your environment temporarily. E.g. create a temp file for a {testthat} test and clean it up afterwards. #rstats

@datascience@genomic.social
2025-05-26 10:00:00

{dtrack} makes documentation of data wrangling part of the analysis and creates pretty flow charts: #rstats

@datascience@genomic.social
2025-07-26 10:00:01

The {purrr} package is a powerfull way to replace loops. The {furrr} package takes this approach one step further by parallel execution: #rstats

@datascience@genomic.social
2025-08-06 10:00:01

GitHub Actions for the R language: Makes automatic testing of your R package much easier and making sure your package works on different OS and R versions is a matter of just a few lines of yaml: #rstats

@datascience@genomic.social
2025-05-25 10:00:01

Make paint-by-number pictures in R: #rstats

@datascience@genomic.social
2025-06-24 10:00:00

{annotater}: Annotate package load calls, so we can have an idea of the overall purpose of the libraries we’re loading: #rstats

@datascience@genomic.social
2025-07-24 10:00:01

Make sure your code follows a consitent style using the {lintr} package. #rstats

@datascience@genomic.social
2025-06-22 10:00:01

{ggrain}: Raincloud plots for ggplot2: #rstats #dataviz

@datascience@genomic.social
2025-05-23 10:00:01

Handy RStudio addins for reshaping and navigating in code: #rstats #rstudio

@datascience@genomic.social
2025-07-06 10:00:16

{spiralize} can be used to highlight cyclic data, e.g. multi year time series. #rstats

@datascience@genomic.social
2025-07-22 10:00:02

r-charts.com provides example code for a variety of chart types, both in base R and ggplot: #rstats #ggplot

@datascience@genomic.social
2025-05-22 10:00:01

Lets be honest, we spend too much time cleaning data. {janitor} can help with that: #rstats

@datascience@genomic.social
2025-06-21 10:00:01

{testthat} is great for automatic testing. Here are some tricks for the heavy user: #rstats

@datascience@genomic.social
2025-06-19 10:00:01

{ivs} makes it easier to work with intervals: #rstats

@datascience@genomic.social
2025-07-19 10:00:01

Function-oriented Make-like declarative workflows for R #rstats

@datascience@genomic.social
2025-05-19 10:00:02

Visualize dependencies between functions: #rstats

@datascience@genomic.social
2025-05-18 10:00:01

Make fancy tables: #rstats #tables

@datascience@genomic.social
2025-07-18 10:00:01

A template for data analysis projects structured as R packages (or not) #rstats #datascience

@datascience@genomic.social
2025-06-01 10:00:01

Cute comics of R functions by @…: allisonhorst.com/r-packages-fu

@datascience@genomic.social
2025-08-01 10:00:01

Do you (sometimes) use print() or message() for debugging your code? Next time you can use {icecream} instead: #rstats

@datascience@genomic.social
2025-07-01 10:00:01

I am more fluent in LaTeX than in plotmath expression. If you are the same, latex2exp will make your life easier. cran.r-project.org/web/package

@datascience@genomic.social
2025-05-30 10:00:01

Its good to have many tests in your R package, but it can be a pain to debug some failing tests when it happens. {lazytest} for the rescue: only rerun the failing tests, until they pass: #RStats

@datascience@genomic.social
2025-05-24 10:00:01

Friends Don't Let Friends Make Bad Graphs! Do you agree with the examples of bad graphs and the alternatives Chenxin Li (@chenxinli2.bsky.social) lists at github.com/cxli233/FriendsDont

@datascience@genomic.social
2025-05-21 10:00:01

Easier debugging of piped analyses in R: github.com/MilesMcBain/breaker by @…

@datascience@genomic.social
2025-05-20 10:00:01

If you set limits for a scale (e.g. x-axis) in ggplot, how would you like data outside of that range be handled? There is the oob parameter for that and a set of functions to use with it: scales.r-lib.org/reference/oob

@datascience@genomic.social
2025-07-23 10:00:01

The {cli} package makes it easy to show pretty and informative messages to the user of your code. #rstats #terminal

@datascience@genomic.social
2025-07-20 10:00:01

{constructive} prints code that can be used to recreate R objects. Like dput, but better... #rstats