Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csLG_bot@mastoxiv.page
2025-07-31 09:20:21

SourceSplice: Source Selection for Machine Learning Tasks
Ambarish Singh, Romila Pradhan
arxiv.org/abs/2507.22186 arxiv.org/pdf/2507.22186

@arXiv_csRO_bot@mastoxiv.page
2025-08-01 09:55:41

Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents
Shaofei Cai, Zhancun Mu, Haiwen Xia, Bowei Zhang, Anji Liu, Yitao Liang
arxiv.org/abs/2507.23698

@arXiv_csHC_bot@mastoxiv.page
2025-08-01 08:39:41

iLearnRobot: An Interactive Learning-Based Multi-Modal Robot with Continuous Improvement
Kohou Wang, ZhaoXiang Liu, Lin Bai, Kun Fan, Xiang Liu, Huan Hu, Kai Wang, Shiguo Lian
arxiv.org/abs/2507.22896

@oekologisch_unterwegs@mastodon.online
2025-08-30 16:24:09

Die #Graugans, der Vorfahre unserer #Hausgans, beeindruckt mit bis zu 90 cm Länge und 4 kg Gewicht. Diese Vögel sind in Europa und Westasien verbreitet und bevorzugen Landschaften mit Zugang zu Süßwasser. Ihr lautes Geschnatter ist charakteristisch, aber leider habe ich noch keine Tonaufnahme. 🦢🌾🌍…

@arXiv_csIR_bot@mastoxiv.page
2025-07-01 07:43:33

Machine Assistant with Reliable Knowledge: Enhancing Student Learning via RAG-based Retrieval
Yongsheng Lian
arxiv.org/abs/2506.23026

@arXiv_csCV_bot@mastoxiv.page
2025-06-30 10:16:50

MiCo: Multi-image Contrast for Reinforcement Visual Reasoning
Xi Chen, Mingkang Zhu, Shaoteng Liu, Xiaoyang Wu, Xiaogang Xu, Yu Liu, Xiang Bai, Hengshuang Zhao
arxiv.org/abs/2506.22434

@arXiv_csCL_bot@mastoxiv.page
2025-07-30 10:28:01

Post-Training Large Language Models via Reinforcement Learning from Self-Feedback
Carel van Niekerk, Renato Vukovic, Benjamin Matthias Ruppik, Hsien-chin Lin, Milica Ga\v{s}i\'c
arxiv.org/abs/2507.21931

@arXiv_csCL_bot@mastoxiv.page
2025-07-30 10:18:01

Libra: Assessing and Improving Reward Model by Learning to Think
Meng Zhou, Bei Li, Jiahao Liu, Xiaowen Shi, Yang Bai, Rongxiang Weng, Jingang Wang, Xunliang Cai
arxiv.org/abs/2507.21645

@arXiv_csCL_bot@mastoxiv.page
2025-07-31 09:53:11

From Sufficiency to Reflection: Reinforcement-Guided Thinking Quality in Retrieval-Augmented Reasoning for LLMs
Jie He, Victor Gutierrez Basulto, Jeff Z. Pan
arxiv.org/abs/2507.22716

@arXiv_csCL_bot@mastoxiv.page
2025-08-27 09:54:23

Knowing or Guessing? Robust Medical Visual Question Answering via Joint Consistency and Contrastive Learning
Songtao Jiang, Yuxi Chen, Sibo Song, Yan Zhang, Yeying Jin, Yang Feng, Jian Wu, Zuozhu Liu
arxiv.org/abs/2508.18687