Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csAI_bot@mastoxiv.page
2025-08-15 08:18:52

Promoting Efficient Reasoning with Verifiable Stepwise Reward
Chuhuai Yue, Chengqi Dong, Yinan Gao, Hang He, Jiajun Chai, Guojun Yin, Wei Lin
arxiv.org/abs/2508.10293

@fanf@mendeddrum.org
2025-09-14 14:42:03

from my link log —
cargo-crev: A web-of-trust code review system for Rust.
github.com/crev-dev/cargo-crev
saved 2025-09-09

@arXiv_csCR_bot@mastoxiv.page
2025-10-14 11:39:28

A Scalable, Privacy-Preserving Decentralized Identity and Verifiable Data Sharing Framework based on Zero-Knowledge Proofs
Hui Yuan
arxiv.org/abs/2510.09715

@arXiv_csLG_bot@mastoxiv.page
2025-08-15 10:21:22

Efficiently Verifiable Proofs of Data Attribution
Ari Karchmer, Seth Neel, Martin Pawelczyk
arxiv.org/abs/2508.10866 arxiv.org/pdf/2508.108…

@arXiv_csCL_bot@mastoxiv.page
2025-10-15 10:44:11

Reasoning Pattern Matters: Learning to Reason without Human Rationales
Chaoxu Pang, Yixuan Cao, Ping Luo
arxiv.org/abs/2510.12643 arxiv.org…

@arXiv_csAI_bot@mastoxiv.page
2025-08-15 09:21:52

MM-Food-100K: A 100,000-Sample Multimodal Food Intelligence Dataset with Verifiable Provenance
Yi Dong, Yusuke Muraoka, Scott Shi, Yi Zhang
arxiv.org/abs/2508.10429

@arXiv_csLG_bot@mastoxiv.page
2025-10-15 08:21:22

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving
Ruida Wang, Jiarui Yao, Rui Pan, Shizhe Diao, Tong Zhang
arxiv.org/abs/2510.11769

@arXiv_csLG_bot@mastoxiv.page
2025-08-15 10:19:22

Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models
Zhipeng Chen, Xiaobo Qin, Youbin Wu, Yue Ling, Qinghao Ye, Wayne Xin Zhao, Guang Shi
arxiv.org/abs/2508.10751

@arXiv_csAI_bot@mastoxiv.page
2025-08-14 07:38:52

MEML-GRPO: Heterogeneous Multi-Expert Mutual Learning for RLVR Advancement
Weitao Jia, Jinghui Lu, Haiyang Yu, Siqi Wang, Guozhi Tang, An-Lan Wang, Weijie Yin, Dingkang Yang, Yuxiang Nie, Bin Shan, Hao Feng, Irene Li, Kun Yang, Han Wang, Jingqun Tang, Teng Fu, Changhong Jin, Chao Feng, Xiaohui Lv, Can Huang
arxiv.org/abs/2508.09670…

@arXiv_csAI_bot@mastoxiv.page
2025-08-14 08:58:12

RAGulating Compliance: A Multi-Agent Knowledge Graph for Regulatory QA
Bhavik Agarwal, Hemant Sunil Jomraj, Simone Kaplunov, Jack Krolick, Viktoria Rojkova
arxiv.org/abs/2508.09893