Representation Potentials of Foundation Models for Multimodal Alignment: A Survey
Jianglin Lu, Hailing Wang, Yi Xu, Yizhou Wang, Kuo Yang, Yun Fu
https://arxiv.org/abs/2510.05184
ODKE : Ontology-Guided Open-Domain Knowledge Extraction with LLMs
Samira Khorshidi, Azadeh Nikfarjam, Suprita Shankar, Yisi Sang, Yash Govind, Hyun Jang, Ali Kasgari, Alexis McClimans, Mohamed Soliman, Vishnu Konda, Ahmed Fakhry, Xiaoguang Qi
https://arxiv.org/abs/2509.04696
Multimodal Foundation Model-Driven User Interest Modeling and Behavior Analysis on Short Video Platforms
Yushang Zhao, Yike Peng, Li Zhang, Qianyi Sun, Zhihui Zhang, Yingying Zhuang
https://arxiv.org/abs/2509.04751
Ah yes, MIPS Technologies.
https://mips.com/products/hardware/
The famous designer of MIPS (microprocessor without interlocked pipeline stages) architecture processors.
Such a well named company /s
Agile Tradespace Exploration for Space Rendezvous Mission Design via Transformers
Yuji Takubo, Daniele Gammelli, Marco Pavone, Simone D'Amico
https://arxiv.org/abs/2510.03544
UItron: Foundational GUI Agent with Advanced Perception and Planning
Zhixiong Zeng, Jing Huang, Liming Zheng, Wenkang Han, Yufeng Zhong, Lei Chen, Longrong Yang, Yingjie Chu, Yuzhi He, Lin Ma
https://arxiv.org/abs/2508.21767
Foundation Model-Driven Grasping of Unknown Objects via Center of Gravity Estimation
Kang Xiangli, Yage He, Xianwu Gong, Zehan Liu, Yuru Bai
https://arxiv.org/abs/2507.19242 htt…
A First Look at Privacy Risks of Android Task-executable Voice Assistant Applications
Shidong Pan, Yikai Ge, Xiaoyu Sun
https://arxiv.org/abs/2509.23680 https://
MedDINOv3: How to adapt vision foundation models for medical image segmentation?
Yuheng Li, Yizhou Wu, Yuxiang Lai, Mingzhe Hu, Xiaofeng Yang
https://arxiv.org/abs/2509.02379 ht…
From Seeing to Experiencing: Scaling Navigation Foundation Models with Reinforcement Learning
Honglin He, Yukai Ma, Wayne Wu, Bolei Zhou
https://arxiv.org/abs/2507.22028 https:/…