You could 3D print a little sign holder handle...
➡️ https://www.printables.com/model/1248678-ergonomic-foam-board-holder-for-rallyprotest-signs
$\Delta L$ Normalization: Rethink Loss Aggregation in RLVR
Zhiyuan He, Xufang Luo, Yike Zhang, Yuqing Yang, Lili Qiu
https://arxiv.org/abs/2509.07558 https://
VerIF: Verification Engineering for Reinforcement Learning in Instruction Following
Hao Peng, Yunjia Qi, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li
https://arxiv.org/abs/2506.09942 …
🇺🇦 Auf radioeins läuft...
R. L. Burnside:
🎵 It's Bad You Know
#NowPlaying #RLBurnside
https://flamingomix.bandcamp.com/track/r-l-burnside-its-bad-you-know-flamingo-edit
https://open.spotify.com/track/1AcvqJhm4CXOFJ7INbR5rR
🇺🇦 #NowPlaying on BBCRadio3's #InTune
William Walton, Royal Liverpool Philharmonic Orchestra & Sir Charles Groves:
🎵 Funeral March Overture (Hamlet)
#WilliamWalton #RoyalLiverpoolPhilharmonicOrchestra #SirCharlesGroves
R1-RE: Cross-Domain Relationship Extraction with RLVR
Runpeng Dai, Tong Zheng, Run Yang, Hongtu Zhu
https://arxiv.org/abs/2507.04642 https://
The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Long Li, Jiaran Hao, Jason Klein Liu, Zhijian Zhou, Xiaoyu Tan, Wei Chu, Zhe Wang, Shirui Pan, Chao Qu, Yuan Qi
https://arxiv.org/abs/2509.07430
IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards
Xu Guo, Tianyi Liang, Tong Jian, Xiaogui Yang, Ling-I Wu, Chenhui Li, Zhihui Lu, Qipeng Guo, Kai Chen
https://arxiv.org/abs/2508.04632
Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR
Jiaming Li, Longze Chen, Ze Gong, Yukun Chen, Lu Wang, Wanwei He, Run Luo, Min Yang
https://arxiv.org/abs/2509.02522
Rethinking Verification for LLM Code Generation: From Generation to Testing
Zihan Ma, Taolin Zhang, Maosong Cao, Wenwei Zhang, Minnan Luo, Songyang Zhang, Kai Chen
https://arxiv.org/abs/2507.06920