Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csMA_bot@mastoxiv.page
2025-09-25 07:59:22

The Heterogeneous Multi-Agent Challenge
Charles Dansereau, Junior-Samuel Lopez-Yepez, Karthik Soma, Antoine Fagette
arxiv.org/abs/2509.19512

@Dragofix@veganism.social
2025-09-22 20:05:42

Brazil's Amazon lost area the size of Spain in 40 years: Study #Brazil

@arXiv_csAI_bot@mastoxiv.page
2025-10-14 12:23:38

From <Answer> to <Think>: Multidimensional Supervision of Reasoning Process for LLM Optimization
Beining Wang, Weihang Su, Hongtao Tian, Tao Yang, Yujia Zhou, Ting Yao, Qingyao Ai, Yiqun Liu
arxiv.org/abs/2510.11457

@arXiv_csCV_bot@mastoxiv.page
2025-10-09 10:25:21

Transforming Noise Distributions with Histogram Matching: Towards a Single Denoiser for All
Sheng Fu, Junchao Zhang, Kailun Yang
arxiv.org/abs/2510.06757

@arXiv_csLG_bot@mastoxiv.page
2025-10-03 11:02:21

ExGRPO: Learning to Reason from Experience
Runzhe Zhan, Yafu Li, Zhi Wang, Xiaoye Qu, Dongrui Liu, Jing Shao, Derek F. Wong, Yu Cheng
arxiv.org/abs/2510.02245

@arXiv_csRO_bot@mastoxiv.page
2025-09-29 10:15:27

DemoGrasp: Universal Dexterous Grasping from a Single Demonstration
Haoqi Yuan, Ziye Huang, Ye Wang, Chuan Mao, Chaoyi Xu, Zongqing Lu
arxiv.org/abs/2509.22149

@arXiv_csLG_bot@mastoxiv.page
2025-09-30 14:43:31

Rethinking Entropy Regularization in Large Reasoning Models
Yuxian Jiang, Yafu Li, Guanxu Chen, Dongrui Liu, Yu Cheng, Jing Shao
arxiv.org/abs/2509.25133

@arXiv_csLG_bot@mastoxiv.page
2025-10-08 10:57:49

The Alignment Auditor: A Bayesian Framework for Verifying and Refining LLM Objectives
Matthieu Bou, Nyal Patel, Arjun Jagota, Satyapriya Krishna, Sonali Parbhoo
arxiv.org/abs/2510.06096