Automating Steering for Safe Multimodal Large Language Models
Lyucheng Wu, Mengru Wang, Ziwen Xu, Tri Cao, Nay Oo, Bryan Hooi, Shumin Deng
https://arxiv.org/abs/2507.13255
When Safe Unimodal Inputs Collide: Optimizing Reasoning Chains for Cross-Modal Safety in Multimodal Large Language Models
Wei Cai, Shujuan Liu, Jian Zhao, Ziyan Shi, Yusheng Zhao, Yuchen Yuan, Tianle Zhang, Chi Zhang, Xuelong Li
https://arxiv.org/abs/2509.12060
Europas erste eigene Prozessor-Entwicklung ist auf dem Weg!
🇪🇺💻 SiPearl hat einen wichtigen Meilenstein erreicht: Das Unternehmen hat sein CPU-Design für den Rhea1-Prozessor an TSMC in Taiwan geschickt, wo nun die ersten Chips produziert werden.
Zum Artikel: https://
Effective Training Data Synthesis for Improving MLLM Chart Understanding
Yuwei Yang, Zeyu Zhang, Yunzhong Hou, Zhuowan Li, Gaowen Liu, Ali Payani, Yuan-Sen Ting, Liang Zheng
https://arxiv.org/abs/2508.06492
AdjustAR: AI-Driven In-Situ Adjustment of Site-Specific Augmented Reality Content
Nels Numan, Jessica Van Brummelen, Ziwen Lu, Anthony Steed
https://arxiv.org/abs/2508.06826 htt…
Dynamic Uncertainty-aware Multimodal Fusion for Outdoor Health Monitoring
Zihan Fang, Zheng Lin, Senkang Hu, Yihang Tao, Yiqin Deng, Xianhao Chen, Yuguang Fang
https://arxiv.org/abs/2508.09085
Combating the Memory Walls: Optimization Pathways for Long-Context Agentic LLM Inference
Haoran Wu, Can Xiao, Jiayi Nie, Xuan Guo, Binglei Lou, Jeffrey T. H. Wong, Zhiwen Mo, Cheng Zhang, Przemyslaw Forys, Wayne Luk, Hongxiang Fan, Jianyi Cheng, Timothy M. Jones, Rika Antonova, Robert Mullins, Aaron Zhao
https://arxiv.org/abs/2509.09505
JWST-TST DREAMS: Secondary Atmosphere Constraints for the Habitable Zone Planet TRAPPIST-1 e
Ana Glidden, Sukrit Ranjan, Sara Seager, N\'estor Espinoza, Ryan J. MacDonald, Natalie H. Allen, Caleb I. Ca\~nas, David Grant, Am\'elie Gressier, Kevin B. Stevenson, Natasha E. Batalha, Nikole K. Lewis, Douglas Long, Hannah R. Wakeford, Lili Alderson, Ryan C. Challener, Knicole Col\'on, Jingcheng Huang, Zifan Lin, Dana R. Louie, Elijah Mullens, Kristin S. Sotzen, Jeff A. Valenti, D…
Spatial-ORMLLM: Improve Spatial Relation Understanding in the Operating Room with Multimodal Large Language Model
Peiqi He, Zhenhao Zhang, Yixiang Zhang, Xiongjun Zhao, Shaoliang Peng
https://arxiv.org/abs/2508.08199
MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision
Zhonghao Yan, Muxi Diao, Yuxuan Yang, Jiayuan Xu, Kaizhou Zhang, Ruoyan Jing, Lele Yang, Yanxi Liu, Kongming Liang, Zhanyu Ma
https://arxiv.org/abs/2508.08177