Large Language Models as Nondeterministic Causal Models
Sander Beckers
https://arxiv.org/abs/2509.22297 https://arxiv.org/pdf/2509.22297
The Lie of the Average: How Class Incremental Learning Evaluation Deceives You?
Guannan Lai, Da-Wei Zhou, Xin Yang, Han-Jia Ye
https://arxiv.org/abs/2509.22580 https://
CHRONOBERG: Capturing Language Evolution and Temporal Awareness in Foundation Models
Niharika Hegde, Subarnaduti Paul, Lars Joel-Frey, Manuel Brack, Kristian Kersting, Martin Mundt, Patrick Schramowski
https://arxiv.org/abs/2509.22360
Color Names in Vision-Language Models
Alexandra Gomez-Villa, Pablo Hern\'andez-C\'amara, Muhammad Atif Butt, Valero Laparra, Jesus Malo, Javier Vazquez-Corral
https://arxiv.org/abs/2509.22524
Evaluating the Limits of Large Language Models in Multilingual Legal Reasoning
Antreas Ioannou, Andreas Shiamishis, Nora Hollenstein, Nezihe Merve G\"urel
https://arxiv.org/abs/2509.22472
Guiding Evolution of Artificial Life Using Vision-Language Models
Nikhil Baid, Hannah Erlebach, Paul Hellegouarch, Frederico Wieser
https://arxiv.org/abs/2509.22447 https://
Death of the Novel(ty): Beyond n-Gram Novelty as a Metric for Textual Creativity
Arkadiy Saakyan, Najoung Kim, Smaranda Muresan, Tuhin Chakrabarty
https://arxiv.org/abs/2509.22641
Evaluating LLMs for Combinatorial Optimization: One-Phase and Two-Phase Heuristics for 2D Bin-Packing
Syed Mahbubul Huq, Daniel Brito, Daniel Sikar, Rajesh Mojumder
https://arxiv.org/abs/2509.22255
ProRe: A Proactive Reward System for GUI Agents via Reasoner-Actor Collaboration
Gaole Dai, Shiqi Jiang, Ting Cao, Yuqing Yang, Yuanchun Li, Rui Tan, Mo Li, Lili Qiu
https://arxiv.org/abs/2509.21823
StepORLM: A Self-Evolving Framework With Generative Process Supervision For Operations Research Language Models
Chenyu Zhou, Tianyi Xu, Jianghao Lin, Dongdong Ge
https://arxiv.org/abs/2509.22558