Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@rasterweb@mastodon.social
2025-06-12 18:58:55

You could 3D print a little sign holder handle...
➡️ printables.com/model/1248678-e

@arXiv_csLG_bot@mastoxiv.page
2025-09-10 10:36:11

$\Delta L$ Normalization: Rethink Loss Aggregation in RLVR
Zhiyuan He, Xufang Luo, Yike Zhang, Yuqing Yang, Lili Qiu
arxiv.org/abs/2509.07558

@arXiv_csCL_bot@mastoxiv.page
2025-06-12 09:05:02

VerIF: Verification Engineering for Reinforcement Learning in Instruction Following
Hao Peng, Yunjia Qi, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li
arxiv.org/abs/2506.09942

@radioeinsmusicbot@mastodonapp.uk
2025-08-09 11:18:08

🇺🇦 Auf radioeins läuft...
R. L. Burnside:
🎵 It's Bad You Know
#NowPlaying #RLBurnside
flamingomix.bandcamp.com/track
open.spotify.com/track/1AcvqJh

@BBC3MusicBot@mastodonapp.uk
2025-09-12 17:13:36

🇺🇦 #NowPlaying on BBCRadio3's #InTune
William Walton, Royal Liverpool Philharmonic Orchestra & Sir Charles Groves:
🎵 Funeral March Overture (Hamlet)
#WilliamWalton #RoyalLiverpoolPhilharmonicOrchestra #SirCharlesGroves

@arXiv_csCL_bot@mastoxiv.page
2025-07-08 13:50:11

R1-RE: Cross-Domain Relationship Extraction with RLVR
Runpeng Dai, Tong Zheng, Run Yang, Hongtu Zhu
arxiv.org/abs/2507.04642

@arXiv_csLG_bot@mastoxiv.page
2025-09-10 10:33:51

The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Long Li, Jiaran Hao, Jason Klein Liu, Zhijian Zhou, Xiaoyu Tan, Wei Chu, Zhe Wang, Shirui Pan, Chao Qu, Yuan Qi
arxiv.org/abs/2509.07430

@arXiv_csCL_bot@mastoxiv.page
2025-08-07 10:27:24

IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards
Xu Guo, Tianyi Liang, Tong Jian, Xiaogui Yang, Ling-I Wu, Chenhui Li, Zhihui Lu, Qipeng Guo, Kai Chen
arxiv.org/abs/2508.04632

@arXiv_csCL_bot@mastoxiv.page
2025-09-03 14:45:23

Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR
Jiaming Li, Longze Chen, Ze Gong, Yukun Chen, Lu Wang, Wanwei He, Run Luo, Min Yang
arxiv.org/abs/2509.02522

@arXiv_csCL_bot@mastoxiv.page
2025-07-10 10:05:21

Rethinking Verification for LLM Code Generation: From Generation to Testing
Zihan Ma, Taolin Zhang, Maosong Cao, Wenwei Zhang, Minnan Luo, Songyang Zhang, Kai Chen
arxiv.org/abs/2507.06920