Tootfinder

Opt-in global Mastodon full text search. Join the index!

@frankel@mastodon.top
2025-11-03 17:23:00

Your data, their rules
blog.42futures.com/p/your-data

@Mediagazer@mstdn.social
2025-12-04 13:55:51

The Tow Center releases a tracker that monitors lawsuits, deals, grants, and other developments between news publishers and AI companies (Klaudia Jaźwińska/Columbia Journalism Review)
cjr.org/analysis/lawsuit-licen

@UP8@mastodon.social
2025-12-04 18:24:54

🚜 California farmland doused with 2.5 million pounds of PFAS pesticides each year, analysis finds
thenewlede.org/2025/11/califor

The government’s biggest coal sales in more than a decade, is coming in a few days,
offering 600 million tons from publicly owned reserves next to strip mines in Montana and Wyoming.
The sales are a signature piece of Trump’s ambitions for companies to dig more coal from federal lands and burn it for electricity.
Yet most power plants served by those mines plan to quit burning coal altogether within 10 years, an Associated Press data analysis shows.
Three other mines po…

@Techmeme@techhub.social
2025-12-03 06:40:52

An analysis of Waymo's data covering ~100M driverless miles across four US cities: Waymo cars have far lower crash rates per million miles than human drivers (Jonathan Slotkin/New York Times)
nytimes.com/2025/12/02/o…

@elduvelle@neuromatch.social
2025-12-04 17:08:40

Hi #Linux team - any recommendations for a work desktop computer with the following requirements:

  • can do basic research stuff (reading, writing) and also a little bit of basic data analysis (with python or Matlab)
  • would run a distribution like #ZorinOS or Mint
  • with a Max…
@adulau@infosec.exchange
2025-12-02 21:19:30

End-of-Year Threat Intelligence Sightings Forecast
This report presents an analysis of Threat Intelligence (TI) Sightings aggregated from several key data sources, including social platforms, code repositories, and specialized TI feeds. The primary objective is to visually track historical trends per source and provide a short-term adaptive forecast for a defined period (in days).
#opensource

@cjust@infosec.exchange
2025-11-02 01:42:27

#ShamelesslyStolenFromBlueSky

§ Jess Calarco® @jessica... 22h
We have progressed from
data collection to data
analysis.
@Techmeme@techhub.social
2025-12-01 14:50:36

An analysis of Crunchbase and PitchBook data: in 2025 so far, 80 tech startups reached $1B valuations, many of them focused on AI with exceptions like Kalshi (Dominic-Madori Davis/TechCrunch)
techcrunch.com/2025/12/01/at-l

@krispijn@social.sargasso.nl
2025-11-29 10:11:01

Revealed: Europe’s water reserves drying up due to climate breakdown.
“When we compare the total terrestrial water storage data with climate datasets, the trends broadly correlate,” said Mohammad Shamsudduha, professor of water crisis and risk reduction at UCL.

@Dragofix@veganism.social
2025-12-31 22:55:23

Regional temperature records broken across the world in 2025 #environment

@Techmeme@techhub.social
2025-10-28 12:40:47

An analysis of 30 US data center proposals in 14 states: in most cases, local officials signed NDAs and worked with apparent shell companies to hide details (Natalie Kainz/NBC News)
nbcnews.com/tech/tech-news/dat

@benb@osintua.eu
2025-12-11 16:44:27

Analysis: New Data Suggests Russia Is Sustaining Mi-8 Output Despite Wartime Losses: benborges.xyz/2025/12/11/analy

ICE shift in tactics leads to soaring number of unjustifiable arrests
Government data shows that more than 60 percent of the people detained in at-large arrests since June did NOT have criminal convictions or pending charges.
-- even as authorities insist that immigration officers are focusing on violent criminals whom they describe as “the worst of the worst.”

@UP8@mastodon.social
2025-10-27 17:13:05

🕶️ Community Analysis of Social Virtual Reality Based on Large-Scale Log Data of a Commercial Metaverse Platform
#vr

Two small figures showing an empty lobby in an VR game and a very full event space.
@nemobis@mamot.fr
2025-10-28 14:10:50

"This paper presents a comprehensive scientometric analysis of the long-term impact of [event] on the nation scientific development."
*oh, interesting!*
"Using Scopus-indexed data..."
*closes tab*

@underdarkGIS@fosstodon.org
2025-11-26 15:55:00

@… following up on our chat at #SDSL2025, I finally found some time to see how a #QGIS Processing Algorithm Provider plugin can be unit tested. Here's what I've …

@felwert@fedihum.org
2025-11-27 10:34:31

Teaching students simple #WebScraping was always quite rewarding. It opens up numerous relevant, real-world data sources that are the foundation for any further analysis. Things already got more complicated with dynamic content loading, but now bot-exclusion-mechanisms make it almost impossible in many cases. Is web scraping for the

@Mediagazer@mstdn.social
2025-10-31 23:21:11

An analysis of AI training datasets, compiled by The Atlantic, shows AI models were trained on hundreds of thousands of YouTube videos from news publishers (Andrew Deck/Nieman Lab)
niemanlab.org/2025/10/hundred…

@datascience@genomic.social
2025-11-10 11:00:00

A template for data analysis projects structured as R packages (or not) github.com/Pakillo/template by @…

@cheeaun@mastodon.social
2025-10-27 04:06:38

RE: mastodon.social/@cheeaun/11541
After looking at this, got curious to know the limits in most servers.
So I did a little data analysis. Servers list from @…

Chart titled "Image Matrix Limits" showing a table lists matrix MP values (2, 17, 33, 38–195) with counts and percentages and bar graph: 33 MP dominates (2145, 93.50%), 17 MP (139, 6.06%), and small entries for 2 MP and 38–195 MP.
Chart titled "Image Size Limits" showing counts and percentages of image sizes (MB) with a horizontal bar graph. The 16 MB row dominates (2,081 items, 90.71%) while other size buckets (4–5, 8, 10, 15, 19, 20, 24–32, 38–48, 50–99, 100–1354 MB) show much smaller counts and percentages.
Chart titled "Video Matrix Limits" showing matrix sizes 2MP (138, 6.02%), 8MP (2149, 93.68%) and 9–36MP (7, 0.31%) with horizontal bar graph.
Chart titled "Video Size Limits" showing size bins (10–20, 40, 50–80, 86–98, 99, 100, 128–160, 200, 250–800, 990–2048 MB) with counts and percentages; the 99 MB row dominates with count 2086 (90.93%).
@arXiv_csDB_bot@mastoxiv.page
2025-10-15 07:33:51

Aixel: A Unified, Adaptive and Extensible System for AI-powered Data Analysis
Meihui Zhang, Liming Wang, Chi Zhang, Zhaojing Luo
arxiv.org/abs/2510.12642

@arXiv_csLG_bot@mastoxiv.page
2025-10-10 11:13:29

Synthetic Series-Symbol Data Generation for Time Series Foundation Models
Wenxuan Wang, Kai Wu, Yujian Betterest Li, Dan Wang, Xiaoyu Zhang
arxiv.org/abs/2510.08445

@ErikJonker@mastodon.social
2025-12-24 15:59:00

Inside the deportation machine (giftlink)
nytimes.com/interactive/2025/1

@arXiv_csCY_bot@mastoxiv.page
2025-10-14 09:58:18

Mapping the Urban Mobility Intelligence Frontier: A Scientometric Analysis of Data-Driven Pedestrian Trajectory Prediction and Simulation
Junhao Xu, Hui Zeng
arxiv.org/abs/2510.10327

@scott@carfree.city
2025-11-22 05:31:43

This is a good start but the subway should curve south down 19th Ave, meet up with Daly City BART and continue on the BART tracks down to Millbrae. That part is essential; a branch to Outer Richmond could be added later as a nice-to-have.
musubi3.github.io/sfmta-geary-

@arXiv_statML_bot@mastoxiv.page
2025-10-10 09:27:09

High-dimensional Analysis of Synthetic Data Selection
Parham Rezaei, Filip Kovacevic, Francesco Locatello, Marco Mondelli
arxiv.org/abs/2510.08123

@arXiv_csCR_bot@mastoxiv.page
2025-10-13 09:50:20

The Data Enclave Advantage: A New Paradigm for Least-Privileged Data Access in a Zero-Trust World
Nico Bistolfi, Andreea Georgescu, Dave Hodson
arxiv.org/abs/2510.09494

@arXiv_csCV_bot@mastoxiv.page
2025-10-14 13:45:28

MS-Mix: Unveiling the Power of Mixup for Multimodal Sentiment Analysis
Hongyu Zhu, Lin Chen, Mounim A. El-Yacoubi, Mingsheng Shang
arxiv.org/abs/2510.11579

Geospatial Reasoning fuses weather, satellite and population
data with Gemini AI for risk analysis
It runs in Google's Trusted Tester program for early access
(That ain't you)
testingcatalog.com/google-laun

@arXiv_eessSY_bot@mastoxiv.page
2025-10-07 11:23:42

PowerPlots: An Open Source Power Grid Visualization and Data Analysis Framework for Academic Research
Noah Rhodes
arxiv.org/abs/2510.05063

@bthalpin@mastodon.social
2025-11-07 13:20:47

If there is any truth in these allegations, we really have to worry about that is going on in both AGS and GSOC.
The initial story was bad enough: the Irish police service being so incompetent that their statistics on homicides were wildly incorrect, and the whistleblower getting penalised; but this is batshit.

@Dragofix@veganism.social
2025-10-28 02:48:41

Most Cambodia & Laos tree cover loss in 2024 happened inside protected areas news.mongabay.com/short-articl

@arXiv_astrophCO_bot@mastoxiv.page
2025-10-14 10:28:38

Updated constraints on interacting dark energy: A comprehensive analysis using multiple CMB probes, DESI DR2, and supernovae observations
Tian-Nuo Li, Guo-Hong Du, Yun-He Li, Yichao Li, Jia-Le Ling, Jing-Fei Zhang, Xin Zhang
arxiv.org/abs/2510.11363

@arXiv_csCE_bot@mastoxiv.page
2025-10-14 09:28:48

Multi-Physics-Enhanced Bayesian Inverse Analysis: Information Gain from Additional Fields
Lea J. Haeusel, Jonas Nitzler, Lea J. K\"oglmeier, Wolfgang A. Wall
arxiv.org/abs/2510.11095

@metacurity@infosec.exchange
2025-10-21 13:54:28

Dataminr to acquire cybersecurity firm ThreatConnect for $290 million
cyberscoop.com/dataminr-threat

@arXiv_csAI_bot@mastoxiv.page
2025-10-06 09:55:29

CoDA: Agentic Systems for Collaborative Data Visualization
Zichen Chen, Jiefeng Chen, Sercan \"O. Arik, Misha Sra, Tomas Pfister, Jinsung Yoon
arxiv.org/abs/2510.03194

@arXiv_statME_bot@mastoxiv.page
2025-10-15 08:38:22

Monitoring 3D Lattice Structures in Additive Manufacturing Using Topological Data Analysis
Yulin An, Xueqi Zhao, Enrique del Castillo
arxiv.org/abs/2510.11740

@arXiv_hepth_bot@mastoxiv.page
2025-10-08 09:40:39

Exploring Quantum Spacetime with Topological Data Analysis
J. van der Duin, R. Loll, M. Schiffer, A. Silva
arxiv.org/abs/2510.05693 arxiv.o…

@arXiv_hepph_bot@mastoxiv.page
2025-10-15 09:33:21

First simultaneous global QCD analysis of kaon and pion parton distributions with lattice QCD constraints
P. C. Barry, Chueng-Ryong Ji, W. Melnitchouk, N. Sato, Fernanda Steffens
arxiv.org/abs/2510.11979

@arXiv_grqc_bot@mastoxiv.page
2025-10-09 08:33:40

The impact of missing data on the construction of LISA Time Delay Interferometry Michelson variables
Ollie Burke, Martina Muratore, Graham Woan
arxiv.org/abs/2510.06406

@arXiv_csCL_bot@mastoxiv.page
2025-10-10 10:59:29

SenWave: A Fine-Grained Multi-Language Sentiment Analysis Dataset Sourced from COVID-19 Tweets
Qiang Yang, Xiuying Chen, Changsheng Ma, Rui Yin, Xin Gao, Xiangliang Zhang
arxiv.org/abs/2510.08214

@Techmeme@techhub.social
2025-12-16 11:05:52

Analysis: of the 8,808 global data centers in October 2025, ~7,000 are in areas outside the optimal 18C to 27C temperature range; 600 are in areas above 27C (Rest of World)
restofworld.org/2025/data-cent

@jlpiraux@wallonie-bruxelles.social
2025-12-06 08:03:04

98 % des arbres fruitiers et oliviers de Gaza ont été détruits. 90 % des serres sont endommagées et 75 % détruites, selon une analyse des images satellitaires.
zmescience.com/science/news-sc

@cwilcke@bildung.social
2025-12-23 15:16:26

#nytimes #report #usapol
"Inside the Deportation Machine"
-
"At least 32 people have died in ICE custody since Mr. Trump took office, more than the number in Mr. Biden’s entire…

@arXiv_csDC_bot@mastoxiv.page
2025-10-06 08:15:29

Energy Efficiency in Cloud-Based Big Data Processing for Earth Observation: Gap Analysis and Future Directions
Adhitya Bhawiyuga, Serkan Girgin, Rolf A. de By, Raul Zurita-Milla
arxiv.org/abs/2510.02882

@gla@mastodon.social
2025-10-17 07:04:36

Claude skills are a big deal™️
Thanks to skills, you can reduce your multi-agent setup to a single agent with skills, greatly reducing complexity and increasing speed of execution.
In fact, if in the past you could have a number of agents each specialized in, for example, data analysis, getting data from a particular set of websites, making that data available in a dashboard, etc., with skills you can substitute all these agents with skills. (1/2)

@peter_mcmahan@mas.to
2025-12-10 16:58:59

I rewrote a data analysis pipeline, moving it from #python to #julialang . I am now in love with the threading support in Julia.
The task is very parallelizable but each thread needs random read access to a tens-of-GB dataset. In Python (with multiprocessing, shared stores, etc) data bookkeeping was a nightmar…

A screenshot of a part of one row from `top` showing a julia process using 4388% CPU and 51% memory, with a running time of 3 weeks.
@arXiv_econEM_bot@mastoxiv.page
2025-10-13 07:52:40

Sensitivity Analysis for Causal ML: A Use Case at Booking.com
Philipp Bach, Victor Chernozhukov, Carlos Cinelli, Lin Jia, Sven Klaassen, Nils Skotara, Martin Spindler
arxiv.org/abs/2510.09109

@thek3nger@mastodon.social
2025-11-20 08:58:16

There is enough data to start publishing reports of my statistical analysis of the Italian Volleyball Serie A1 championship.
davideaversa.it/experiment/vol

@arXiv_physicsdataan_bot@mastoxiv.page
2025-10-14 15:02:51

Crosslisted article(s) found for physics.data-an. arxiv.org/list/physics.data-an
[1/1]:
- Ensemble-Based Data Assimilation for Material Model Characterization in High-Velocity Impact
Rong Jin, Guangyao Wang, Xingsheng Sun

@grumpybozo@toad.social
2025-10-17 16:32:24

Kayak: in principle, an application that may be well-served by "old" rules-based AI. Its function is supposed to be deterministic, needing more data ingestion & analysis than humans can tolerate.
But if they're calling it "AI" today, I'm sure it's a LLM/neural net gadget which will hallucinate flights & fares. Because we're playing out the theory that the #XRisk

@arXiv_astrophIM_bot@mastoxiv.page
2025-10-10 09:09:39

Photometric Redshift Estimation for Rubin Observatory Data Preview 1 with Redshift Assessment Infrastructure Layers (RAIL)
T. Zhang, E. Charles, J. F. Crenshaw, S. J. Schmidt, P. Adari, J. Gschwend, S. Mau, B. Andrews, E. Aubourg, Y. Bains, K. Bechtol, A. Boucaud, D. Boutigny, P. Burchat, J. Chevalier, J. Chiang, H. -F. Chiang, D. Clowe, J. Cohen-Tanugi, C. Combet, A. Connolly, S. Dagoret-Campagne, P. N. Daly, F. Daruich, G. Daubard, J. De Vicente, H. Drass, K. Fanning, E. Gawiser, M. …

@arXiv_csHC_bot@mastoxiv.page
2025-10-10 08:56:29

Sentiment Matters: An Analysis of 200 Human-SAV Interactions
Lirui Guo, Michael G. Burke, Wynita M. Griggs
arxiv.org/abs/2510.08202 arxiv.o…

@arXiv_statOT_bot@mastoxiv.page
2025-10-08 08:12:09

Missing Data Imputation in the Context of Propensity Score Analysis: A Systematic Review
Saghar Garayemi, Reza Ali Akbari Khoei, Sarah Friedrich
arxiv.org/abs/2510.05857

@arXiv_csLG_bot@mastoxiv.page
2025-10-15 10:44:31

Evaluation of Real-Time Preprocessing Methods in AI-Based ECG Signal Analysis
Jasmin Freudenberg, Kai Hahn, Christian Weber, Madjid Fathi
arxiv.org/abs/2510.12541

@cosmos4u@scicomm.xyz
2025-11-11 22:34:26

Joint neutrino oscillation analysis from the T2K and NOvA experiments: #neutrinos may hold the keys to why we exist: eurekalert.org/news-releases/1 - MSU scientists help merge data from two neutrino experiments to offer most precise look yet at elusive particles.

@arXiv_qfinST_bot@mastoxiv.page
2025-10-14 08:42:18

The Pitfalls of Continuous Heavy-Tailed Distributions in High-Frequency Data Analysis
Vladim\'ir Hol\'y
arxiv.org/abs/2510.09785 ar…

@arXiv_csSE_bot@mastoxiv.page
2025-10-07 11:29:12

RevMine: An LLM-Assisted Tool for Code Review Mining and Analysis Across Git Platforms
Samah Kansab, Francis Bordeleau, Ali Tizghadam
arxiv.org/abs/2510.04796

@Techmeme@techhub.social
2025-11-27 15:55:40

Analysis: since 2023, data center power demands have delayed 15 coal plants' retirements; the Trump administration has ordered two power plants to remain open (Ariel Wittenberg/Politico)
politico.com/news/2025/11/27/a

@arXiv_astrophHE_bot@mastoxiv.page
2025-10-06 09:21:09

Analysis of the Supernova Remnant IC 443 using H.E.S.S. Data
Alison M. W. Mitchell (for the H.E.S.S. Collaboration), Lukas Grosspietsch (for the H.E.S.S. Collaboration), Tina Wach (for the H.E.S.S. Collaboration)
arxiv.org/abs/2510.02843

@arXiv_eessIV_bot@mastoxiv.page
2025-10-08 09:24:29

Adapting HFMCA to Graph Data: Self-Supervised Learning for Generalizable fMRI Representations
Jakub Frac, Alexander Schmatz, Qiang Li, Guido Van Wingen, Shujian Yu
arxiv.org/abs/2510.05177

@Sustainable2050@mastodon.energy
2025-12-06 07:51:31

“This seems the best bang for your buck; it’s less per year than private school.”, said the future mother.
UK IVF couples use legal loophole to rank embryos based on potential IQ, height and health
theguar…

@arXiv_csCV_bot@mastoxiv.page
2025-10-15 07:53:01

Data or Language Supervision: What Makes CLIP Better than DINO?
Yiming Liu, Yuhui Zhang, Dhruba Ghosh, Ludwig Schmidt, Serena Yeung-Levy
arxiv.org/abs/2510.11835

@arXiv_csCR_bot@mastoxiv.page
2025-10-15 09:58:32

PromptLocate: Localizing Prompt Injection Attacks
Yuqi Jia, Yupei Liu, Zedian Shao, Jinyuan Jia, Neil Gong
arxiv.org/abs/2510.12252 arxiv.o…

@arXiv_statME_bot@mastoxiv.page
2025-10-14 10:50:48

Developing an information criterion for spatial data analysis through Bayesian generalized fused lasso
Yuko Kakikawa, Yoshiyuki Ninomiya
arxiv.org/abs/2510.11172

@UP8@mastodon.social
2025-11-24 15:50:23

👨🏿‍🌾 Traces of old farm chemicals contaminate water across the US
#chemicals

@gla@mastodon.social
2025-10-17 07:04:36

Claude skills are a big deal™️
Thanks to skills, you can reduce your multi-agent setup to a single agent with skills, greatly reducing complexity and increasing speed of execution.
In fact, if in the past you could have a number of agents each specialized in, for example, data analysis, getting data from a particular set of websites, making that data available in a dashboard, etc., with skills you can substitute all these agents with skills. (1/2)

@arXiv_csAI_bot@mastoxiv.page
2025-10-15 09:27:52

BeSTAD: Behavior-Aware Spatio-Temporal Anomaly Detection for Human Mobility Data
Junyi Xie, Jina Kim, Yao-Yi Chiang, Lingyi Zhao, Khurram Shafique
arxiv.org/abs/2510.12076

@arXiv_csDB_bot@mastoxiv.page
2025-10-09 07:35:50

Bridging Imperative Process Models and Process Data Queries-Translation and Relaxation
Abdur Rehman Anwar Qureshi, Adrian Rebmann, Timotheus Kampik, Matthias Weidlich, Mathias Weske
arxiv.org/abs/2510.06414

@arXiv_csCL_bot@mastoxiv.page
2025-10-15 10:48:41

Cost Analysis of Human-corrected Transcription for Predominately Oral Languages
Yacouba Diarra, Nouhoum Souleymane Coulibaly, Michael Leventhal
arxiv.org/abs/2510.12781

@arXiv_physicsdataan_bot@mastoxiv.page
2025-10-15 12:44:59

Replaced article(s) found for physics.data-an. arxiv.org/list/physics.data-an
[1/1]:
- Maximum entropy temporal networks
Paolo Barucca

@Techmeme@techhub.social
2025-12-20 07:35:55

A look at Meta's 2GW Hyperion data center in Louisiana, with the first phase opening in 2028; an analysis shows sales tax breaks on GPUs could total $3.3B (Jon Keegan/Sherwood News)
sherwood.news/tech/hyperion/

@arXiv_hepph_bot@mastoxiv.page
2025-10-08 09:45:09

nCTEQ global analysis of nuclear PDFs
M. Klasen
arxiv.org/abs/2510.05880 arxiv.org/pdf/2510.05880

@arXiv_statML_bot@mastoxiv.page
2025-10-07 09:04:12

Transformed $\ell_1$ Regularizations for Robust Principal Component Analysis: Toward a Fine-Grained Understanding
Kun Zhao, Haoke Zhang, Jiayi Wang, Yifei Lou
arxiv.org/abs/2510.03624

@arXiv_csCY_bot@mastoxiv.page
2025-10-15 07:58:21

The Adoption Paradox: A Comparative Analysis of Veterinary AI Adoption in China and the North America
Shumin Li, Xiaoyun Lai
arxiv.org/abs/2510.11758

@elduvelle@neuromatch.social
2025-12-12 13:37:50

Between #Matlab and #Python, which one would you recommend to learn, for a student who wants to learn programming (from scratch) to do data analysis? And why?
I am conflicted because I think Matlab is maybe slightly more straightforward to learn, but Python should be more useful in the long …

@arXiv_eessSY_bot@mastoxiv.page
2025-10-07 10:09:42

Data-driven Practical Stabilization of Nonlinear Systems via Chain Policies: Sample Complexity and Incremental Learning
Roy Siegelmann, Enrique Mallada
arxiv.org/abs/2510.03982

@arXiv_csCR_bot@mastoxiv.page
2025-10-08 10:12:29

"Your Doctor is Spying on You": An Analysis of Data Practices in Mobile Healthcare Applications
Luke Stevenson, Sanchari Das
arxiv.org/abs/2510.06015

@arXiv_statME_bot@mastoxiv.page
2025-10-15 10:01:21

The $\alpha$--regression for compositional data: a unified framework for standard, spatially-lagged, and geographically-weighted regression models
Michail Tsagris
arxiv.org/abs/2510.12663

@Techmeme@techhub.social
2025-12-24 19:36:08

Analysis: Oracle has moved $66B of debt for building AI data centers off its balance sheet using SPVs; Meta has moved $30B, xAI moved $20B, and CoreWeave $2.6B (Tabby Kinder/Financial Times)
ft.com/content/0ae9d6cd-6b94-4

@arXiv_csAI_bot@mastoxiv.page
2025-10-08 10:33:19

TelecomTS: A Multi-Modal Observability Dataset for Time Series and Language Analysis
Austin Feng, Andreas Varvarigos, Ioannis Panitsas, Daniela Fernandez, Jinbiao Wei, Yuwei Guo, Jialin Chen, Ali Maatouk, Leandros Tassiulas, Rex Ying
arxiv.org/abs/2510.06063

@arXiv_physicsdataan_bot@mastoxiv.page
2025-10-15 11:23:03

Crosslisted article(s) found for physics.data-an. arxiv.org/list/physics.data-an
[1/1]:
- A mathematical theory for understanding when abstract representations emerge in neural networks
Bin Wang, W. Jeffrey Johnston, Stefano Fusi

@arXiv_csDB_bot@mastoxiv.page
2025-10-13 07:34:00

Comparative Performance Analysis of Modern NoSQL Data Technologies: Redis, Aerospike, and Dragonfly
Deep Bodra, Sushil Khairnar
arxiv.org/abs/2510.08863

@Dragofix@veganism.social
2025-10-08 23:26:08

Protected areas hit hard as Mekong countries’ forest cover shrank in 2024 news.mongabay.com/2025/10/prot

@Mediagazer@mstdn.social
2025-10-11 12:10:49

Parrot Analytics: movies made up ~50% of streaming revenue across Netflix, Prime Video, Disney , and WBD in 2024, up from ~27% in 2022, based on engagement data (Alejandro Rojas/The Hollywood Reporter)
hollywoodreporter.com/business

@arXiv_csLG_bot@mastoxiv.page
2025-12-22 10:32:30

You Only Train Once: Differentiable Subset Selection for Omics Data
Daphn\'e Chopard, Jorge da Silva Gon\c{c}alves, Irene Cannistraci, Thomas M. Sutter, Julia E. Vogt
arxiv.org/abs/2512.17678 arxiv.org/pdf/2512.17678 arxiv.org/html/2512.17678
arXiv:2512.17678v1 Announce Type: new
Abstract: Selecting compact and informative gene subsets from single-cell transcriptomic data is essential for biomarker discovery, improving interpretability, and cost-effective profiling. However, most existing feature selection approaches either operate as multi-stage pipelines or rely on post hoc feature attribution, making selection and prediction weakly coupled. In this work, we present YOTO (you only train once), an end-to-end framework that jointly identifies discrete gene subsets and performs prediction within a single differentiable architecture. In our model, the prediction task directly guides which genes are selected, while the learned subsets, in turn, shape the predictive representation. This closed feedback loop enables the model to iteratively refine both what it selects and how it predicts during training. Unlike existing approaches, YOTO enforces sparsity so that only the selected genes contribute to inference, eliminating the need to train additional downstream classifiers. Through a multi-task learning design, the model learns shared representations across related objectives, allowing partially labeled datasets to inform one another, and discovering gene subsets that generalize across tasks without additional training steps. We evaluate YOTO on two representative single-cell RNA-seq datasets, showing that it consistently outperforms state-of-the-art baselines. These results demonstrate that sparse, end-to-end, multi-task gene subset selection improves predictive performance and yields compact and meaningful gene subsets, advancing biomarker discovery and single-cell analysis.
toXiv_bot_toot

As Cyber Threats Escalate, the National Vulnerability Database Is Falling Behind
The National Institute of Standards and Technology (NIST) is struggling.
It faces a growing backlog to process data in its vulnerability repository, which publicly shares information assessing and detailing mitigation solutions against new cyber exploits.
With nearly 1,800 new reported vulnerabilities sitting in a queue for analysis this year, delays in processing leave the United States increa…

@arXiv_statME_bot@mastoxiv.page
2025-10-07 10:34:52

Two new approaches to multiple canonical correlation analysis for repeated measures data
Tomasz G\'orecki, Miros{\l}aw Krzy\'sko, Felix Gnettner, Piotr Kokoszka
arxiv.org/abs/2510.04457

@arXiv_csAI_bot@mastoxiv.page
2025-10-08 10:17:39

Early Multimodal Prediction of Cross-Lingual Meme Virality on Reddit: A Time-Window Analysis
Sedat Dogan, Nina Dethlefs, Debarati Chakraborty
arxiv.org/abs/2510.05761

@Techmeme@techhub.social
2025-11-12 12:36:00

An analysis of 47,000 publicly shared ChatGPT conversations: ~10% related to emotional or mental health, ChatGPT exhibits a "default to yes" behavior, and more (Washington Post)
washingtonpost.com/technology/

@arXiv_physicsdataan_bot@mastoxiv.page
2025-10-14 18:46:57

Replaced article(s) found for physics.data-an. arxiv.org/list/physics.data-an
[1/1]:
- Universal emergence of local Zipf-Mandelbrot law
Davide Cugini, Andr\'e Timpanaro, Giacomo Livan, Giacomo Guarnieri

@arXiv_csDB_bot@mastoxiv.page
2025-10-14 07:40:01

Efficient Mining of Low-Utility Sequential Patterns
Jian Zhu, Zhidong Lin, Wensheng Gan, Ruichu Cai, Zhifeng Hao, Philip S. Yu
arxiv.org/abs/2510.10243

@Techmeme@techhub.social
2025-10-21 13:25:49

Threat intel company Dataminr plans to acquire cybersecurity threat intel provider ThreatConnect for $290M; Dataminr raised $85M in convertible funding in March (Greg Otto/CyberScoop)
cyberscoop.com/dataminr-threat

@arXiv_physicsdataan_bot@mastoxiv.page
2025-10-13 13:05:48

Replaced article(s) found for physics.data-an. arxiv.org/list/physics.data-an
[1/1]:
- DeepOHeat-v1: Efficient Operator Learning for Fast and Trustworthy Thermal Simulation and Optimiz...
Xinling Yu, Ziyue Liu, Hai Li, Yixing Li, Xin Ai, Zhiyu Zeng, Ian Young, …

@arXiv_physicsdataan_bot@mastoxiv.page
2025-10-13 11:14:25

Crosslisted article(s) found for physics.data-an. arxiv.org/list/physics.data-an
[1/1]:
- Deep Learning of the Biswas-Chatterjee-Sen Model
Neto, Alencar, Brito, Alves, Lima, Macedo-Filho, Ferreira, Alves

@Techmeme@techhub.social
2025-11-06 17:20:57

Google adds Gemini's Deep Search to Google Finance, which will also have prediction market data from Kalshi and Polymarket for event analysis, first in the US (Aamir Siddiqui/Android Authority)
androidauthority.com/google-fi

@arXiv_physicsdataan_bot@mastoxiv.page
2025-10-07 17:47:26

Replaced article(s) found for physics.data-an. arxiv.org/list/physics.data-an
[1/1]:
- Comparative Analysis of Richardson-Lucy Deconvolution and Data Unfolding with Mean Integrated Squ...
Nikolay D. Gagunashvili

@arXiv_physicsdataan_bot@mastoxiv.page
2025-10-08 11:27:54

Crosslisted article(s) found for physics.data-an. arxiv.org/list/physics.data-an
[1/1]:
- Functional Connectivity Networks for Transportation Delay Analysis: from Theory to Software
Carlson Moses B\"uth, Massimiliano Zanin

@Techmeme@techhub.social
2025-10-08 06:40:47

An analysis of 1,100 TikTok users' watch histories across six months in 2024 shows how effective TikTok is at getting even its heaviest users to watch more (Washington Post)