Inclusion Arena: An Open Platform for Evaluating Large Foundation Models with Real-World Apps
Kangyu Wang, Hongliang He, Lin Liu, Ruiqi Liang, Zhenzhong Lan, Jianguo Li
https://arxiv.org/abs/2508.11452
EgoCross: Benchmarking Multimodal Large Language Models for Cross-Domain Egocentric Video Question Answering
Yanjun Li, Yuqian Fu, Tianwen Qian, Qi'ao Xu, Silong Dai, Danda Pani Paudel, Luc Van Gool, Xiaoling Wang
https://arxiv.org/abs/2508.10729
ReFineG: Synergizing Small Supervised Models and LLMs for Low-Resource Grounded Multimodal NER
Jielong Tang, Shuang Wang, Zhenxing Wang, Jianxing Yu, Jian Yin
https://arxiv.org/abs/2509.10975
Adapting and Evaluating Multimodal Large Language Models for Adolescent Idiopathic Scoliosis Self-Management: A Divide and Conquer Framework
Zhaolong Wu, Pu Luo, Jason Pui Yin Cheung, Teng Zhang
https://arxiv.org/abs/2509.11645
When Language Overrules: Revealing Text Dominance in Multimodal Large Language Models
Huyu Wu, Meng Tang, Xinhan Zheng, Haiyun Jiang
https://arxiv.org/abs/2508.10552 https://
WildScore: Benchmarking MLLMs in-the-Wild Symbolic Music Reasoning
Gagan Mundada, Yash Vishe, Amit Namburi, Xin Xu, Zachary Novack, Julian McAuley, Junda Wu
https://arxiv.org/abs/2509.04744
ODKE : Ontology-Guided Open-Domain Knowledge Extraction with LLMs
Samira Khorshidi, Azadeh Nikfarjam, Suprita Shankar, Yisi Sang, Yash Govind, Hyun Jang, Ali Kasgari, Alexis McClimans, Mohamed Soliman, Vishnu Konda, Ahmed Fakhry, Xiaoguang Qi
https://arxiv.org/abs/2509.04696
Domain size asymptotics for Markov logic networks
Vera Koponen
https://arxiv.org/abs/2509.04192 https://arxiv.org/pdf/2509.04192
FineBadminton: A Multi-Level Dataset for Fine-Grained Badminton Video Understanding
Xusheng He, Wei Liu, Shanshan Ma, Qian Liu, Chenghao Ma, Jianlong Wu
https://arxiv.org/abs/2508.07554
SCDF: A Speaker Characteristics DeepFake Speech Dataset for Bias Analysis
Vojt\v{e}ch Stan\v{e}k, Karel Srna, Anton Firc, Kamil Malinka
https://arxiv.org/abs/2508.07944 https://…