Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@fanf@mendeddrum.org
2025-09-14 14:42:03

from my link log —
cargo-crev: A web-of-trust code review system for Rust.
github.com/crev-dev/cargo-crev
saved 2025-09-09

@arXiv_csCR_bot@mastoxiv.page
2025-10-14 11:39:28

A Scalable, Privacy-Preserving Decentralized Identity and Verifiable Data Sharing Framework based on Zero-Knowledge Proofs
Hui Yuan
arxiv.org/abs/2510.09715

@arXiv_csSE_bot@mastoxiv.page
2025-10-13 09:18:20

Faver: Boosting LLM-based RTL Generation with Function Abstracted Verifiable Middleware
Jianan Mu, Mingyu Shi, Yining Wang, Tianmeng Yang, Bin Sun, Xing Hu, Jing Ye, Huawei Li
arxiv.org/abs/2510.08664

@arXiv_csAI_bot@mastoxiv.page
2025-08-11 09:13:39

SKATE, a Scalable Tournament Eval: Weaker LLMs differentiate between stronger ones using verifiable challenges
Dewi S. W. Gould, Bruno Mlodozeniec, Samuel F. Brown
arxiv.org/abs/2508.06111

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 10:25:00

Spotlight on Token Perception for Multimodal Reinforcement Learning
Siyuan Huang, Xiaoye Qu, Yafu Li, Yun Luo, Zefeng He, Daizong Liu, Yu Cheng
arxiv.org/abs/2510.09285

@arXiv_csCL_bot@mastoxiv.page
2025-10-10 10:51:29

Interpreting LLM-as-a-Judge Policies via Verifiable Global Explanations
Jasmina Gajcin, Erik Miehling, Rahul Nair, Elizabeth Daly, Radu Marinescu, Seshu Tirupathi
arxiv.org/abs/2510.08120

@arXiv_csAI_bot@mastoxiv.page
2025-08-14 07:38:52

MEML-GRPO: Heterogeneous Multi-Expert Mutual Learning for RLVR Advancement
Weitao Jia, Jinghui Lu, Haiyang Yu, Siqi Wang, Guozhi Tang, An-Lan Wang, Weijie Yin, Dingkang Yang, Yuxiang Nie, Bin Shan, Hao Feng, Irene Li, Kun Yang, Han Wang, Jingqun Tang, Teng Fu, Changhong Jin, Chao Feng, Xiaohui Lv, Can Huang
arxiv.org/abs/2508.09670…

@arXiv_csCL_bot@mastoxiv.page
2025-08-14 09:43:42

Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning
Vaishnavi Shrivastava, Ahmed Awadallah, Vidhisha Balachandran, Shivam Garg, Harkirat Behl, Dimitris Papailiopoulos
arxiv.org/abs/2508.09726

@arXiv_csAI_bot@mastoxiv.page
2025-08-14 08:58:12

RAGulating Compliance: A Multi-Agent Knowledge Graph for Regulatory QA
Bhavik Agarwal, Hemant Sunil Jomraj, Simone Kaplunov, Jack Krolick, Viktoria Rojkova
arxiv.org/abs/2508.09893

@arXiv_csCL_bot@mastoxiv.page
2025-08-14 07:32:32

ParallelSearch: Train your LLMs to Decompose Query and Search Sub-queries in Parallel with Reinforcement Learning
Shu Zhao, Tan Yu, Anbang Xu, Japinder Singh, Aaditya Shukla, Rama Akkiraju
arxiv.org/abs/2508.09303