This https://arxiv.org/abs/2506.00618 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csAI_…
RiOSWorld: Benchmarking the Risk of Multimodal Compter-Use Agents
Jingyi Yang, Shuai Shao, Dongrui Liu, Jing Shao
https://arxiv.org/abs/2506.00618 https://…
Hidden in Plain Sight: Probing Implicit Reasoning in Multimodal Language Models
Qianqi Yan, Hongquan Li, Shan Jiang, Yang Zhao, Xinze Guan, Ching-Chen Kuo, Xin Eric Wang
https://arxiv.org/abs/2506.00258