Axiomatic Choice and the Decision-Evaluation Paradox
Ben Abramowitz, Nicholas Mattei
https://arxiv.org/abs/2509.21836 https://arxiv.org/pdf/2509.21836
LLM Agent Meets Agentic AI: Can LLM Agents Simulate Customers to Evaluate Agentic-AI-based Shopping Assistants?
Lu Sun, Shihan Fu, Bingsheng Yao, Yuxuan Lu, Wenbo Li, Hansu Gu, Jiri Gesi, Jing Huang, Chen Luo, Dakuo Wang
https://arxiv.org/abs/2509.21501
Steelers' Calvin Austin being evaluated in Dublin hospital after suffering shoulder injury in win over Vikings
https://www.cbssports.com/nfl/news/s…
GW241011 and GW241110 - Exploring Binary Formation and Fundamental Physics with Asymmetric, High-spin #BlackHole Coalescences: https://iopscience.iop.org/article/10.3847/2041-8213/ae0d54 -> Colliding black holes might have formed from earlier cosmic smashups: https://www.caltech.edu/about/news/colliding-black-holes-might-have-formed-from-earlier-cosmic-smashups and https://www.ligo.caltech.edu/news/ligo20251028 and https://www.ozgrav.org/news/twin-black-hole-mergers-reveal-secrets-of-cosmic-evolution/ - two recent detections have given the international LIGO-Virgo-KAGRA collaboration new insights into black hole formation and evolution -> Gravitational Wave Detectors Spot Merging Black Holes That Have Merged Before: https://aasnova.org/2025/10/28/not-their-first-rodeo-gravitational-wave-detectors-spot-merging-black-holes-that-have-merged-before/
ProactiveEval: A Unified Evaluation Framework for Proactive Dialogue Agents
Tianjian Liu, Fanqi Wan, Jiajian Guo, Xiaojun Quan
https://arxiv.org/abs/2508.20973 https://
I need to read it properly, but this looks 🔥 https://arxiv.org/abs/2511.16652
Evaluating Differentially Private Generation of Domain-Specific Text
Yidan Sun, Viktor Schlegel, Srinivasan Nandakumar, Iqra Zahid, Yuping Wu, Warren Del-Pinto, Goran Nenadic, Siew-Kei Lam, Jie Zhang, Anil A Bharath
https://arxiv.org/abs/2508.20452
Evaluating LLMs for Combinatorial Optimization: One-Phase and Two-Phase Heuristics for 2D Bin-Packing
Syed Mahbubul Huq, Daniel Brito, Daniel Sikar, Rajesh Mojumder
https://arxiv.org/abs/2509.22255
The Lie of the Average: How Class Incremental Learning Evaluation Deceives You?
Guannan Lai, Da-Wei Zhou, Xin Yang, Han-Jia Ye
https://arxiv.org/abs/2509.22580 https://
Guiding Evolution of Artificial Life Using Vision-Language Models
Nikhil Baid, Hannah Erlebach, Paul Hellegouarch, Frederico Wieser
https://arxiv.org/abs/2509.22447 https://