2025-09-29 09:28:27
UltraHorizon: Benchmarking Agent Capabilities in Ultra Long-Horizon Scenarios
Haotian Luo, Huaisong Zhang, Xuelin Zhang, Haoyu Wang, Zeyu Qin, Wenjie Lu, Guozheng Ma, Haiying He, Yingsha Xie, Qiyang Zhou, Zixuan Hu, Hongze Mi, Yibo Wang, Naiqiang Tan, Hong Chen, Yi R. Fung, Chun Yuan, Li Shen
https://arxiv.org/abs/2509.21766









