Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csDC_bot@mastoxiv.page
2025-06-05 07:17:43

Crowd-SFT: Crowdsourcing for LLM Alignment
Alex Sotiropoulos, Sulyab Thottungal Valapu, Linus Lei, Jared Coleman, Bhaskar Krishnamachari
arxiv.org/abs/2506.04063

@arXiv_csCL_bot@mastoxiv.page
2025-06-03 08:19:46

Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning
Fangyu Lei, Jinxiang Meng, Yiming Huang, Tinghong Chen, Yun Zhang, Shizhu He, Jun Zhao, Kang Liu
arxiv.org/abs/2506.01710

@arXiv_csSE_bot@mastoxiv.page
2025-06-03 17:33:48

This arxiv.org/abs/2505.23387 has been replaced.
initial toot: mastoxiv.page/@arXiv_csSE_…

@arXiv_csCR_bot@mastoxiv.page
2025-06-04 07:26:46

BitBypass: A New Direction in Jailbreaking Aligned Large Language Models with Bitstream Camouflage
Kalyan Nakka, Nitesh Saxena
arxiv.org/abs/2506.02479

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 07:20:42

Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs
Yufa Zhou, Shaobo Wang, Xingyu Dong, Xiangqi Jin, Yifang Chen, Yue Min, Kexin Yang, Xingzhang Ren, Dayiheng Liu, Linfeng Zhang
arxiv.org/abs/2506.00577

@arXiv_csGR_bot@mastoxiv.page
2025-06-02 09:56:59

This arxiv.org/abs/2505.19713 has been replaced.
initial toot: mastoxiv.page/@arXiv_csGR_…

@arXiv_csSE_bot@mastoxiv.page
2025-05-30 07:21:33

Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization
Mingzhe Du, Luu Tuan Tuan, Yue Liu, Yuhao Qing, Dong Huang, Xinyi He, Qian Liu, Zejun Ma, See-kiong Ng
arxiv.org/abs/2505.23387