STRive: An association rule-based system for the exploration of spatiotemporal categorical data
Mauro Diaz, Luis Sante, Joel Perca, Jo\~ao Victor da Silva, Nivan Ferreira, Jorge Poco
https://arxiv.org/abs/2509.02732
A nonparametric Bayesian analysis of independent and identically distributed observations of covariate-driven Poisson processes
Patric Dolmeta, Matteo Giordano
https://arxiv.org/abs/2509.02299
A systematic comparison of Large Language Models for automated assignment assessment in programming education: Exploring the importance of architecture and vendor
Marcin Jukiewicz
https://arxiv.org/abs/2509.26483
SPATA: Systematic Pattern Analysis for Detailed and Transparent Data Cards
Jo\~ao Vitorino, Eva Maia, Isabel Pra\c{c}a, Carlos Soares
https://arxiv.org/abs/2509.26640 https://…
An Anthropic report details how Claude usage varies by country and US states, finding 36% use it for coding, 77% of enterprise use is for automation, and more (Anthropic)
https://www.anthropic.com/research/anthropic-economic-index-september-2025-report
Sleep Disorder Diagnosis Using EEG Signals and LSTM Deep Learning Method
Mohammad Reza Yousefi, Reza Rahimi
https://arxiv.org/abs/2509.00208 https://arxiv.…
Learning Short-Term and Long-Term Patterns of High-Order Dynamics in Real-World Networks
Yunyong Ko, Da Eun Lee, Song Kyung Yu, Sang-Wook Kim
https://arxiv.org/abs/2508.17236 ht…
Finally, what Xia & Lindell call a "separation problem" is, in our view, a feature of our approach and not a bug.
If, e.g., all languages in a family are polysynthetic (or none are), that’s not a statistical artefact – it’s the signal. The outcome is well associated with genealogy, showing that family membership captures someth genuinely informative about the process. When the model finds that family explains a large share of the variance, that's not a failure–it's evidence that phylogenetic structure dominates the pattern.
So while Xia & Lindell insist that "autocorrelation due to relationships and distance cannot be captured in family or regional-level analyses", we see that as an empirical question – and we treated it as one.
The real test is whether a mixed model that explicitly represents phylogeny and geography performs worse than their alternative, where the entire shared history of languages and environments is effectively collapsed into a single dimension (an eigenvector).
In other words: we model relationships – Xia & Lindell summarise them into one number per language.
Visual Analytics for Causal Reasoning from Real-World Health Data
Arran Zeyu Wang, David Borland, David Gotz
https://arxiv.org/abs/2508.17474 https://arxiv…
Patterns in the Transition From Founder-Leadership to Community Governance of Open Source
Mobina Noori, Mahasweta Chakraborti, Amy X Zhang, Seth Frey
https://arxiv.org/abs/2509.16295