Revisiting Compositional Generalization Capability of Large Language Models Considering Instruction Following Ability
Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe
https://arxiv.org/abs/2506.15629
Inclusion Arena: An Open Platform for Evaluating Large Foundation Models with Real-World Apps
Kangyu Wang, Hongliang He, Lin Liu, Ruiqi Liang, Zhenzhong Lan, Jianguo Li
https://arxiv.org/abs/2508.11452
MetaLint: Generalizable Idiomatic Code Quality Analysis through Instruction-Following and Easy-to-Hard Generalization
Atharva Naik, Lawanya Baghel, Dhakshin Govindarajan, Darsh Agrawal, Daniel Fried, Carolyn Rose
https://arxiv.org/abs/2507.11687
Dual-Stage Value-Guided Inference with Margin-Based Reward Adjustment for Fast and Faithful VLM Captioning
Ankan Deria, Adinath Madhavrao Dukre, Feilong Tang, Sara Atito, Sudipta Roy, Muhammad Awais, Muhammad Haris Khan, Imran Razzak
https://arxiv.org/abs/2506.15649
Apparent Resonance Splitting in Self-Coupled Excitonic Systems
Avishek Sarbajna, Qitong Li, Dorte Rub{\ae}k Danielsen, Skyler Peitso Selvin, Duc Hieu Nguyen, Manh-Ha Doan, Peter B{\o}ggild, Mark L. Brongersma, S{\o}ren Raza
https://arxiv.org/abs/2508.11370
Fine-structure Line Atlas for Multi-wavelength Extragalactic Study (FLAMES) III: [C II] as Tracer, Crisis of SFR, [O III]/[C II] at High-z, New Answers and New Questions
Bo Peng, Gordon Stacey, Amit Vishwas, Catie Ball, Cody Lamarche, Christopher Rooney, Thomas Nikola, Carl Ferkinhoff
https://arxiv.org/abs/2507.12896
A Comparative Approach to Assessing Linguistic Creativity of Large Language Models and Humans
Anca Dinu, Andra-Maria Florescu, Alina Resceanu
https://arxiv.org/abs/2507.12039
Thyme: Think Beyond Images
Yi-Fan Zhang, Xingyu Lu, Shukang Yin, Chaoyou Fu, Wei Chen, Xiao Hu, Bin Wen, Kaiyu Jiang, Changyi Liu, Tianke Zhang, Haonan Fan, Kaibing Chen, Jiankang Chen, Haojie Ding, Kaiyu Tang, Zhang Zhang, Liang Wang, Fan Yang, Tingting Gao, Guorui Zhou
https://arxiv.org/abs/2508.11630
HeroBench: A Benchmark for Long-Horizon Planning and Structured Reasoning in Virtual Worlds
Petr Anokhin, Roman Khalikov, Stefan Rebrikov, Viktor Volkov, Artyom Sorokin, Vincent Bissonnette
https://arxiv.org/abs/2508.12782
ImpReSS: Implicit Recommender System for Support Conversations
Omri Haller, Yair Meidan, Dudu Mimran, Yuval Elovici, Asaf Shabtai
https://arxiv.org/abs/2506.14231