2025-12-11 16:44:27
Analysis: New Data Suggests Russia Is Sustaining Mi-8 Output Despite Wartime Losses: https://benborges.xyz/2025/12/11/analysis-new-data-suggests-russia.html
Analysis: New Data Suggests Russia Is Sustaining Mi-8 Output Despite Wartime Losses: https://benborges.xyz/2025/12/11/analysis-new-data-suggests-russia.html
{dtrack} makes documentation of data wrangling part of the analysis and creates pretty flow charts: #rstats
An analysis of 47,000 publicly shared ChatGPT conversations: ~10% related to emotional or mental health, ChatGPT exhibits a "default to yes" behavior, and more (Washington Post)
https://www.washingtonpost.com/technology/2025/11/12/how-people-use-ch…
The Data Enclave Advantage: A New Paradigm for Least-Privileged Data Access in a Zero-Trust World
Nico Bistolfi, Andreea Georgescu, Dave Hodson
https://arxiv.org/abs/2510.09494 …
Sensitivity Analysis for Causal ML: A Use Case at Booking.com
Philipp Bach, Victor Chernozhukov, Carlos Cinelli, Lin Jia, Sven Klaassen, Nils Skotara, Martin Spindler
https://arxiv.org/abs/2510.09109
Comparative Performance Analysis of Modern NoSQL Data Technologies: Redis, Aerospike, and Dragonfly
Deep Bodra, Sushil Khairnar
https://arxiv.org/abs/2510.08863 https://
Joint neutrino oscillation analysis from the T2K and NOvA experiments: #neutrinos may hold the keys to why we exist: https://www.eurekalert.org/news-releases/1103650 - MSU scientists help merge data from two neutrino experiments to offer most precise look yet at elusive particles.
Crosslisted article(s) found for physics.data-an. https://arxiv.org/list/physics.data-an/new
[1/1]:
- Deep Learning of the Biswas-Chatterjee-Sen Model
Neto, Alencar, Brito, Alves, Lima, Macedo-Filho, Ferreira, Alves
🍔 Thermal, Mechanical, And Material Stresses Grow With Die Stacking
https://semiengineering.com/thermal-mechanical-and-material-stresses-grow-with-die-stacking/
Identification of Gamma Ray Pulsar Candidates in the \emph{Fermi}-LAT 4FGL-DR4 Unassociated Sources Using Supervised Machine Learning
A. Pathania, K. K. Singh, S. K. Singh, A. Tolamatti, B. B. Singh, K. K. Yadav
https://arxiv.org/abs/2510.08654
Hierarchical Progressive Survey (HiPS) format: moving from visualisation to scientific analysis
Fabrizio Giordano, Yago Ascasibar, Luca Cortese, Ivan Valtchanov, Bruno Mer\'in
https://arxiv.org/abs/2510.09533
Deep prior-based denoising for state-of-the-art scientific imaging and metrology
Yuichi Yokoyama, Kohei Yamagami, Yuta Sumiya, Hayaru Shouno, Masaichiro Mizumaki
https://arxiv.org/abs/2510.09410
Instance-Aware Robust Consistency Regularization for Semi-Supervised Nuclei Instance Segmentation
Zenan Lin, Wei Li, Jintao Chen, Zihao Wu, Wenxiong Kang, Changxin Gao, Liansheng Wang, Jin-Gang Yu
https://arxiv.org/abs/2510.09329
I rewrote a data analysis pipeline, moving it from #python to #julialang . I am now in love with the threading support in Julia.
The task is very parallelizable but each thread needs random read access to a tens-of-GB dataset. In Python (with multiprocessing, shared stores, etc) data bookkeeping was a nightmar…
"Did the Upper Great Highway closure make Sunset neighborhood streets less safe? Supervisor Alan Wong claimed it did at a January 8, 2026 press conference, citing a simple year-over-year map comparison of crash data. But my analysis, using the same DataSF crash data with rigorous statistical controls, finds no evidence to support that claim, and if anything, the data suggest the opposite."
Constraints on the interacting holographic dark energy models: implications from background and perturbations data
N. Nazari Pooya
https://arxiv.org/abs/2510.08875 https://
Multi-messenger Analysis of Supermassive Black Hole Binaries: The Joint-likelihood Approach
Maria Charisi, Stephen Taylor, Jessie Runnoe, Caitlin Witt, Polina Petrov
https://arxiv.org/abs/2510.08683
Web Crawler Restrictions, AI Training Datasets \& Political Biases
Paul Bouchaud (ISC-PIF, m\'edialab), Pedro Ramaciotti (ISC-PIF, m\'edialab)
https://arxiv.org/abs/2510.09031
VisPile: A Visual Analytics System for Analyzing Multiple Text Documents With Large Language Models and Knowledge Graphs
Adam Coscia, Alex Endert
https://arxiv.org/abs/2510.09605
Measurement of the $e^ e^-\to\eta\gamma$ cross section near the $\phi(1020)$ resonance with the SND detector
SND Collaboration, M. N. Achasov, A. E. Alizzi, A. Yu. Barnyakov, E. V. Bedarev, K. I. Beloborodov, A. V. Berdyugin, A. G. Bogdanchikov, A. A. Botov, D. E. Chistyakov, T. V. Dimova, V. P. Druzhinin, L. V. Kardapoltsev, A. S. Kasaev, A. A. Kattsin, A. G. Kharlamov, I. A. Koop, A. A. Korol, D. P. Kovrizhin, A. S. Kupich, A. P. Kryukov, N. A. Melnikova, N. Yu. Muchnoi, A. E. Obrazo…
A template for data analysis projects structured as R packages (or not) https://github.com/Pakillo/template by @…
Theoretical Analysis of Topotomography Using Small Intragranular Strain Approximations
Zheheng Liu, Nicola Vigano, Henry Proudhon, Wolfgang Ludwig
https://arxiv.org/abs/2510.08712
Modeling, Segmenting and Statistics of Transient Spindles via Two-Dimensional Ornstein-Uhlenbeck Dynamics
C. Sun, D. Fettahoglu, D. Holcman
https://arxiv.org/abs/2512.10844 https://arxiv.org/pdf/2512.10844 https://arxiv.org/html/2512.10844
arXiv:2512.10844v1 Announce Type: new
Abstract: We develop here a stochastic framework for modeling and segmenting transient spindle- like oscillatory bursts in electroencephalogram (EEG) signals. At the modeling level, individ- ual spindles are represented as path realizations of a two-dimensional Ornstein{Uhlenbeck (OU) process with a stable focus, providing a low-dimensional stochastic dynamical sys- tem whose trajectories reproduce key morphological features of spindles, including their characteristic rise{decay amplitude envelopes. On the signal processing side, we propose a segmentation procedure based on Empirical Mode Decomposition (EMD) combined with the detection of a central extremum, which isolates single spindle events and yields a collection of oscillatory atoms. This construction enables a systematic statistical analysis of spindle features: we derive empirical laws for the distributions of amplitudes, inter-spindle intervals, and rise/decay durations, and show that these exhibit exponential tails consistent with the underlying OU dynamics. We further extend the model to a pair of weakly coupled OU processes with distinct natural frequencies, generating a stochastic mixture of slow, fast, and mixed spindles in random temporal order. The resulting framework provides a data- driven framework for the analysis of transient oscillations in EEG and, more generally, in nonstationary time series.
toXiv_bot_toot
On the Strength of Linear Relaxations in Ordered Optimization
V\'ictor Blanco, Diego Laborda, Miguel Mart\'inez-Ant\'on
https://arxiv.org/abs/2510.09166 https://
Sequencing on Silicon: AI SoC Design for Mobile Genomics at the Edge
Sebastian Magierowski, Zhongpan Wu, Abel Beyene, Karim Hammad
https://arxiv.org/abs/2510.09339 https://
If there is any truth in these allegations, we really have to worry about that is going on in both AGS and GSOC.
The initial story was bad enough: the Irish police service being so incompetent that their statistics on homicides were wildly incorrect, and the whistleblower getting penalised; but this is batshit.
Deep Learning of the Biswas-Chatterjee-Sen Model
J. F. Silva Neto, D. S. M. Alencar, L. T. Brito, G. A. Alves, F. W. S. Lima, A. Macedo-Filho, R. S. Ferreira, T. F. A. Alves
https://arxiv.org/abs/2510.09446
The Tow Center releases a tracker that monitors lawsuits, deals, grants, and other developments between news publishers and AI companies (Klaudia Jaźwińska/Columbia Journalism Review)
https://www.cjr.org/analysis/lawsuit-license-openai-micros…
Optimal Binning for Small-Angle Neutron Scattering Data Using the Freedman-Diaconis Rule
Jessie E. An, Chi-Huan Tung, Changwoo Do, Wei-Ren Chen
https://arxiv.org/abs/2510.09581 …
The Impact of Sanctions on decentralised Privacy Tools: A Case Study of Tornado Cash
Raffaele Cristodaro, Benjamin Kramer, Claudio J. Tessone
https://arxiv.org/abs/2510.09443 ht…
@carlos@perceptiveconstructs.com
@carlos@social.perceptiveconstructs.com98 % des arbres fruitiers et oliviers de Gaza ont été détruits. 90 % des serres sont endommagées et 75 % détruites, selon une analyse des images satellitaires.
https://www.zmescience.com/science/news-sc
Euclid preparation. Cosmology Likelihood for Observables in Euclid (CLOE). 4: Validation and Performance
Collaboration, Martinelli, Pezzotta, Sciotti, Blot, Bonici, Camera, Ca\~nas-Herrera, Cardone, Carrilho, Casas, Davini, Di Domizio, Farrens, Goh, Beauchamps, Ili\'c, Joudaki, Keil, Le Brun, Moretti, Pettorino, S\'anchez, Sakr, Tanidis, Tutusaus, Ajani, Crocce, Giocoli, Legrand, Lembo, Lesci, Girones, Nouri-Zonoz, Pamuk, Tsedrik, Bel, Carbone, Duncan, Kilbinger, Lacasa, Lattan…
End-of-Year Threat Intelligence Sightings Forecast
This report presents an analysis of Threat Intelligence (TI) Sightings aggregated from several key data sources, including social platforms, code repositories, and specialized TI feeds. The primary objective is to visually track historical trends per source and provide a short-term adaptive forecast for a defined period (in days).
#opensource
“This seems the best bang for your buck; it’s less per year than private school.”, said the future mother.
UK IVF couples use legal loophole to rank embryos based on potential IQ, height and health
https://www.theguar…
Google adds Gemini's Deep Search to Google Finance, which will also have prediction market data from Kalshi and Polymarket for event analysis, first in the US (Aamir Siddiqui/Android Authority)
https://www.androidauthority.com/google-finance…
🚜 California farmland doused with 2.5 million pounds of PFAS pesticides each year, analysis finds
https://www.thenewlede.org/2025/11/california-farmland-doused-with-2-5-million-pounds-of-pfas-pesticides-each-y…
ICE shift in tactics leads to soaring number of unjustifiable arrests
Government data shows that more than 60 percent of the people detained in at-large arrests since June did NOT have criminal convictions or pending charges.
-- even as authorities insist that immigration officers are focusing on violent criminals whom they describe as “the worst of the worst.”
[2025-10-13 Mon (UTC), 1 new article found for physics.data-an Data Analysis, Statistics and Probability]
toXiv_bot_toot
Beyond Revenue and Welfare: Counterfactual Analysis of Spectrum Auctions with Application to Canada's 3800MHz Allocation
Sara Jalili Shani, Kris Joseph, Michael B. McNally, James R. Wright
https://arxiv.org/abs/2512.08106 https://arxiv.org/pdf/2512.08106 https://arxiv.org/html/2512.08106
arXiv:2512.08106v1 Announce Type: new
Abstract: Spectrum auctions are the primary mechanism through which governments allocate scarce radio frequencies, with outcomes that shape competition, coverage, and innovation in telecommunications markets. While traditional models of spectrum auctions often rely on strong equilibrium assumptions, we take a more parsimonious approach by modeling bidders as myopic and straightforward: in each round, firms simply demand the bundle that maximizes their utility given current prices. Despite its simplicity, this model proves effective in predicting the outcomes of Canada's 2023 auction of 3800 MHz spectrum licenses. Using detailed round-by-round bidding data, we estimate bidders' valuations through a linear programming framework and validate that our model reproduces key features of the observed allocation and price evolution. We then use these estimated valuations to simulate a counterfactual auction under an alternative mechanism that incentivizes deployment in rural and remote regions, aligning with one of the key objectives set out in the Canadian Telecommunications Act. The results show that the proposed mechanism substantially improves population coverage in underserved areas. These findings demonstrate that a behavioral model with minimal assumptions is sufficient to generate reliable counterfactual predictions, making it a practical tool for policymakers to evaluate how alternative auction designs may influence future outcomes. In particular, our study demonstrates a method for counterfactual mechanism design, providing a framework to evaluate how alternative auction rules could advance policy goals such as equitable deployment across Canada.
toXiv_bot_toot
@… harmonic analysis of metrics data I love it
An analysis of 30 US data center proposals in 14 states: in most cases, local officials signed NDAs and worked with apparent shell companies to hide details (Natalie Kainz/NBC News)
https://www.nbcnews.com/tech/tech-news/dat
Most Cambodia & Laos tree cover loss in 2024 happened inside protected areas https://news.mongabay.com/short-article/2025/10/most-cambodia-laos-tree-cover-loss-in-2024-happened-inside-protected-areas/
…
For decades, scientists have been intrigued by a strange twist in the Moon’s history.
Toward its last stages of formation, the lunar mantle likely flipped:
Minerals that had formed at the top sank to its bottom, in a process called lunar mantle overturn.
The idea emerged from simulations based on the analysis of lunar rocks brought back by the Apollo missions,
but a new study published in Nature Geoscience offers the first evidence supporting this theory.
Fou…
"This paper presents a comprehensive scientometric analysis of the long-term impact of [event] on the nation scientific development."
*oh, interesting!*
"Using Scopus-indexed data..."
*closes tab*
Mapping the Urban Mobility Intelligence Frontier: A Scientometric Analysis of Data-Driven Pedestrian Trajectory Prediction and Simulation
Junhao Xu, Hui Zeng
https://arxiv.org/abs/2510.10327
Evaluation of Real-Time Preprocessing Methods in AI-Based ECG Signal Analysis
Jasmin Freudenberg, Kai Hahn, Christian Weber, Madjid Fathi
https://arxiv.org/abs/2510.12541 https:…
Teaching students simple #WebScraping was always quite rewarding. It opens up numerous relevant, real-world data sources that are the foundation for any further analysis. Things already got more complicated with dynamic content loading, but now bot-exclusion-mechanisms make it almost impossible in many cases. Is web scraping for the
RE: https://mastodon.social/@cheeaun/115415146417702654
After looking at this, got curious to know the limits in most servers.
So I did a little data analysis. Servers list from @…
Inside the deportation machine (giftlink)
https://www.nytimes.com/interactive/2025/12/22/us/trump-immigration-deportation-network-ice-arrests.html?unlocked_art…
Multi-Physics-Enhanced Bayesian Inverse Analysis: Information Gain from Additional Fields
Lea J. Haeusel, Jonas Nitzler, Lea J. K\"oglmeier, Wolfgang A. Wall
https://arxiv.org/abs/2510.11095
🕶️ Community Analysis of Social Virtual Reality Based on Large-Scale Log Data of a Commercial Metaverse Platform
#vr
DeeDeeExperiment: Building an infrastructure for integrating and managing omics data analysis results in R/Bioconductor
Najla Abassi, Lea Schwarz, Edoardo Filippi, Federico Marini
https://arxiv.org/abs/2512.05731
Monitoring 3D Lattice Structures in Additive Manufacturing Using Topological Data Analysis
Yulin An, Xueqi Zhao, Enrique del Castillo
https://arxiv.org/abs/2510.11740 https://…
First simultaneous global QCD analysis of kaon and pion parton distributions with lattice QCD constraints
P. C. Barry, Chueng-Ryong Ji, W. Melnitchouk, N. Sato, Fernanda Steffens
https://arxiv.org/abs/2510.11979
Dataminr to acquire cybersecurity firm ThreatConnect for $290 million
https://cyberscoop.com/dataminr-threatconnect-acquisition/
An analysis of Crunchbase and PitchBook data: in 2025 so far, 80 tech startups reached $1B valuations, many of them focused on AI with exceptions like Kalshi (Dominic-Madori Davis/TechCrunch)
https://techcrunch.com/2025/12/01/at-least-36-new-tech-…
Geospatial Reasoning fuses weather, satellite and population
data with Gemini AI for risk analysis
It runs in Google's Trusted Tester program for early access
(That ain't you)
https://www.testingcatalog.com/google-laun…
An analysis of AI training datasets, compiled by The Atlantic, shows AI models were trained on hundreds of thousands of YouTube videos from news publishers (Andrew Deck/Nieman Lab)
https://www.niemanlab.org/2025/10/hundred…
Aixel: A Unified, Adaptive and Extensible System for AI-powered Data Analysis
Meihui Zhang, Liming Wang, Chi Zhang, Zhaojing Luo
https://arxiv.org/abs/2510.12642 https://…
Claude skills are a big deal™️
Thanks to skills, you can reduce your multi-agent setup to a single agent with skills, greatly reducing complexity and increasing speed of execution.
In fact, if in the past you could have a number of agents each specialized in, for example, data analysis, getting data from a particular set of websites, making that data available in a dashboard, etc., with skills you can substitute all these agents with skills. (1/2)
BeSTAD: Behavior-Aware Spatio-Temporal Anomaly Detection for Human Mobility Data
Junyi Xie, Jina Kim, Yao-Yi Chiang, Lingyi Zhao, Khurram Shafique
https://arxiv.org/abs/2510.12076
Cost Analysis of Human-corrected Transcription for Predominately Oral Languages
Yacouba Diarra, Nouhoum Souleymane Coulibaly, Michael Leventhal
https://arxiv.org/abs/2510.12781 …
Regional temperature records broken across the world in 2025 #environment
Analysis: of the 8,808 global data centers in October 2025, ~7,000 are in areas outside the optimal 18C to 27C temperature range; 600 are in areas above 27C (Rest of World)
https://restofworld.org/2025/data-center-heat-map/
Kayak: in principle, an application that may be well-served by "old" rules-based AI. Its function is supposed to be deterministic, needing more data ingestion & analysis than humans can tolerate.
But if they're calling it "AI" today, I'm sure it's a LLM/neural net gadget which will hallucinate flights & fares. Because we're playing out the theory that the #XRisk
There is enough data to start publishing reports of my statistical analysis of the Italian Volleyball Serie A1 championship.
https://davideaversa.it/experiment/volley/seriea1w2025.html
MS-Mix: Unveiling the Power of Mixup for Multimodal Sentiment Analysis
Hongyu Zhu, Lin Chen, Mounim A. El-Yacoubi, Mingsheng Shang
https://arxiv.org/abs/2510.11579 https://
The Pitfalls of Continuous Heavy-Tailed Distributions in High-Frequency Data Analysis
Vladim\'ir Hol\'y
https://arxiv.org/abs/2510.09785 https://ar…
This is a good start but the subway should curve south down 19th Ave, meet up with Daly City BART and continue on the BART tracks down to Millbrae. That part is essential; a branch to Outer Richmond could be added later as a nice-to-have.
https://musubi3.github.io/sfmta-geary-subway…
Developing an information criterion for spatial data analysis through Bayesian generalized fused lasso
Yuko Kakikawa, Yoshiyuki Ninomiya
https://arxiv.org/abs/2510.11172 https:/…
An analysis of Waymo's data covering ~100M driverless miles across four US cities: Waymo cars have far lower crash rates per million miles than human drivers (Jonathan Slotkin/New York Times)
https://www.nytimes.com/2025/12/02/o…
Claude skills are a big deal™️
Thanks to skills, you can reduce your multi-agent setup to a single agent with skills, greatly reducing complexity and increasing speed of execution.
In fact, if in the past you could have a number of agents each specialized in, for example, data analysis, getting data from a particular set of websites, making that data available in a dashboard, etc., with skills you can substitute all these agents with skills. (1/2)
Updated constraints on interacting dark energy: A comprehensive analysis using multiple CMB probes, DESI DR2, and supernovae observations
Tian-Nuo Li, Guo-Hong Du, Yun-He Li, Yichao Li, Jia-Le Ling, Jing-Fei Zhang, Xin Zhang
https://arxiv.org/abs/2510.11363
Data or Language Supervision: What Makes CLIP Better than DINO?
Yiming Liu, Yuhui Zhang, Dhruba Ghosh, Ludwig Schmidt, Serena Yeung-Levy
https://arxiv.org/abs/2510.11835 https:/…
PromptLocate: Localizing Prompt Injection Attacks
Yuqi Jia, Yupei Liu, Zedian Shao, Jinyuan Jia, Neil Gong
https://arxiv.org/abs/2510.12252 https://arxiv.o…
The Adoption Paradox: A Comparative Analysis of Veterinary AI Adoption in China and the North America
Shumin Li, Xiaoyun Lai
https://arxiv.org/abs/2510.11758 https://
Analysis: since 2023, data center power demands have delayed 15 coal plants' retirements; the Trump administration has ordered two power plants to remain open (Ariel Wittenberg/Politico)
https://www.politico.com/news/2025/11/27/a
The $\alpha$--regression for compositional data: a unified framework for standard, spatially-lagged, and geographically-weighted regression models
Michail Tsagris
https://arxiv.org/abs/2510.12663
Unveiling Gamer Archetypes through Multi modal feature Correlations and Unsupervised Learning
Moona Kanwal, Muhammad Sami Siddiqui, Syed Anael Ali
https://arxiv.org/abs/2510.10263
👨🏿🌾 Traces of old farm chemicals contaminate water across the US
#chemicals
A look at Meta's 2GW Hyperion data center in Louisiana, with the first phase opening in 2028; an analysis shows sales tax breaks on GPUs could total $3.3B (Jon Keegan/Sherwood News)
https://sherwood.news/tech/hyperion/
Efficient Mining of Low-Utility Sequential Patterns
Jian Zhu, Zhidong Lin, Wensheng Gan, Ruichu Cai, Zhifeng Hao, Philip S. Yu
https://arxiv.org/abs/2510.10243 https://
Crosslisted article(s) found for physics.data-an. https://arxiv.org/list/physics.data-an/new
[1/1]:
- Ensemble-Based Data Assimilation for Material Model Characterization in High-Velocity Impact
Rong Jin, Guangyao Wang, Xingsheng Sun
As Cyber Threats Escalate, the National Vulnerability Database Is Falling Behind
The National Institute of Standards and Technology (NIST) is struggling.
It faces a growing backlog to process data in its vulnerability repository, which publicly shares information assessing and detailing mitigation solutions against new cyber exploits.
With nearly 1,800 new reported vulnerabilities sitting in a queue for analysis this year, delays in processing leave the United States increa…
Analysis: Oracle has moved $66B of debt for building AI data centers off its balance sheet using SPVs; Meta has moved $30B, xAI moved $20B, and CoreWeave $2.6B (Tabby Kinder/Financial Times)
https://www.ft.com/content/0ae9d6cd-6b94-4e22-a559-f047734bef83
Replaced article(s) found for physics.data-an. https://arxiv.org/list/physics.data-an/new
[1/1]:
- Maximum entropy temporal networks
Paolo Barucca
Replaced article(s) found for physics.data-an. https://arxiv.org/list/physics.data-an/new
[1/1]:
- Universal emergence of local Zipf-Mandelbrot law
Davide Cugini, Andr\'e Timpanaro, Giacomo Livan, Giacomo Guarnieri
Threat intel company Dataminr plans to acquire cybersecurity threat intel provider ThreatConnect for $290M; Dataminr raised $85M in convertible funding in March (Greg Otto/CyberScoop)
https://cyberscoop.com/dataminr-threatconnect-acquisition/
Crosslisted article(s) found for physics.data-an. https://arxiv.org/list/physics.data-an/new
[1/1]:
- A mathematical theory for understanding when abstract representations emerge in neural networks
Bin Wang, W. Jeffrey Johnston, Stefano Fusi
Replaced article(s) found for physics.data-an. https://arxiv.org/list/physics.data-an/new
[1/1]:
- DeepOHeat-v1: Efficient Operator Learning for Fast and Trustworthy Thermal Simulation and Optimiz...
Xinling Yu, Ziyue Liu, Hai Li, Yixing Li, Xin Ai, Zhiyu Zeng, Ian Young, …