SourceSplice: Source Selection for Machine Learning Tasks
Ambarish Singh, Romila Pradhan
https://arxiv.org/abs/2507.22186 https://arxiv.org/pdf/2507.22186
Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents
Shaofei Cai, Zhancun Mu, Haiwen Xia, Bowei Zhang, Anji Liu, Yitao Liang
https://arxiv.org/abs/2507.23698
iLearnRobot: An Interactive Learning-Based Multi-Modal Robot with Continuous Improvement
Kohou Wang, ZhaoXiang Liu, Lin Bai, Kun Fan, Xiang Liu, Huan Hu, Kai Wang, Shiguo Lian
https://arxiv.org/abs/2507.22896
Machine Assistant with Reliable Knowledge: Enhancing Student Learning via RAG-based Retrieval
Yongsheng Lian
https://arxiv.org/abs/2506.23026 https://
MiCo: Multi-image Contrast for Reinforcement Visual Reasoning
Xi Chen, Mingkang Zhu, Shaoteng Liu, Xiaoyang Wu, Xiaogang Xu, Yu Liu, Xiang Bai, Hengshuang Zhao
https://arxiv.org/abs/2506.22434
Post-Training Large Language Models via Reinforcement Learning from Self-Feedback
Carel van Niekerk, Renato Vukovic, Benjamin Matthias Ruppik, Hsien-chin Lin, Milica Ga\v{s}i\'c
https://arxiv.org/abs/2507.21931
Libra: Assessing and Improving Reward Model by Learning to Think
Meng Zhou, Bei Li, Jiahao Liu, Xiaowen Shi, Yang Bai, Rongxiang Weng, Jingang Wang, Xunliang Cai
https://arxiv.org/abs/2507.21645
From Sufficiency to Reflection: Reinforcement-Guided Thinking Quality in Retrieval-Augmented Reasoning for LLMs
Jie He, Victor Gutierrez Basulto, Jeff Z. Pan
https://arxiv.org/abs/2507.22716
Knowing or Guessing? Robust Medical Visual Question Answering via Joint Consistency and Contrastive Learning
Songtao Jiang, Yuxi Chen, Sibo Song, Yan Zhang, Yeying Jin, Yang Feng, Jian Wu, Zuozhu Liu
https://arxiv.org/abs/2508.18687