Promoting Efficient Reasoning with Verifiable Stepwise Reward
Chuhuai Yue, Chengqi Dong, Yinan Gao, Hang He, Jiajun Chai, Guojun Yin, Wei Lin
https://arxiv.org/abs/2508.10293 ht…
from my link log —
cargo-crev: A web-of-trust code review system for Rust.
https://github.com/crev-dev/cargo-crev
saved 2025-09-09 https://
A Scalable, Privacy-Preserving Decentralized Identity and Verifiable Data Sharing Framework based on Zero-Knowledge Proofs
Hui Yuan
https://arxiv.org/abs/2510.09715 https://
Efficiently Verifiable Proofs of Data Attribution
Ari Karchmer, Seth Neel, Martin Pawelczyk
https://arxiv.org/abs/2508.10866 https://arxiv.org/pdf/2508.108…
Reasoning Pattern Matters: Learning to Reason without Human Rationales
Chaoxu Pang, Yixuan Cao, Ping Luo
https://arxiv.org/abs/2510.12643 https://arxiv.org…
MM-Food-100K: A 100,000-Sample Multimodal Food Intelligence Dataset with Verifiable Provenance
Yi Dong, Yusuke Muraoka, Scott Shi, Yi Zhang
https://arxiv.org/abs/2508.10429 http…
GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving
Ruida Wang, Jiarui Yao, Rui Pan, Shizhe Diao, Tong Zhang
https://arxiv.org/abs/2510.11769 https://
Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models
Zhipeng Chen, Xiaobo Qin, Youbin Wu, Yue Ling, Qinghao Ye, Wayne Xin Zhao, Guang Shi
https://arxiv.org/abs/2508.10751
MEML-GRPO: Heterogeneous Multi-Expert Mutual Learning for RLVR Advancement
Weitao Jia, Jinghui Lu, Haiyang Yu, Siqi Wang, Guozhi Tang, An-Lan Wang, Weijie Yin, Dingkang Yang, Yuxiang Nie, Bin Shan, Hao Feng, Irene Li, Kun Yang, Han Wang, Jingqun Tang, Teng Fu, Changhong Jin, Chao Feng, Xiaohui Lv, Can Huang
https://arxiv.org/abs/2508.09670…
RAGulating Compliance: A Multi-Agent Knowledge Graph for Regulatory QA
Bhavik Agarwal, Hemant Sunil Jomraj, Simone Kaplunov, Jack Krolick, Viktoria Rojkova
https://arxiv.org/abs/2508.09893