The Heterogeneous Multi-Agent Challenge
Charles Dansereau, Junior-Samuel Lopez-Yepez, Karthik Soma, Antoine Fagette
https://arxiv.org/abs/2509.19512 https://
Brazil's Amazon lost area the size of Spain in 40 years: Study #Brazil
From <Answer> to <Think>: Multidimensional Supervision of Reasoning Process for LLM Optimization
Beining Wang, Weihang Su, Hongtao Tian, Tao Yang, Yujia Zhou, Ting Yao, Qingyao Ai, Yiqun Liu
https://arxiv.org/abs/2510.11457
Transforming Noise Distributions with Histogram Matching: Towards a Single Denoiser for All
Sheng Fu, Junchao Zhang, Kailun Yang
https://arxiv.org/abs/2510.06757 https://…
ExGRPO: Learning to Reason from Experience
Runzhe Zhan, Yafu Li, Zhi Wang, Xiaoye Qu, Dongrui Liu, Jing Shao, Derek F. Wong, Yu Cheng
https://arxiv.org/abs/2510.02245 https://…
DemoGrasp: Universal Dexterous Grasping from a Single Demonstration
Haoqi Yuan, Ziye Huang, Ye Wang, Chuan Mao, Chaoyi Xu, Zongqing Lu
https://arxiv.org/abs/2509.22149 https://
Rethinking Entropy Regularization in Large Reasoning Models
Yuxian Jiang, Yafu Li, Guanxu Chen, Dongrui Liu, Yu Cheng, Jing Shao
https://arxiv.org/abs/2509.25133 https://…
The Alignment Auditor: A Bayesian Framework for Verifying and Refining LLM Objectives
Matthieu Bou, Nyal Patel, Arjun Jagota, Satyapriya Krishna, Sonali Parbhoo
https://arxiv.org/abs/2510.06096