Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csCL_bot@mastoxiv.page
2025-06-03 08:21:02

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
Shenzhi Wang, Le Yu, Chang Gao, Chujie Zheng, Shixuan Liu, Rui Lu, Kai Dang, Xionghui Chen, Jianxin Yang, Zhenru Zhang, Yuqiong Liu, An Yang, Andrew Zhao, Yang Yue, Shiji Song, Bowen Yu, Gao Huang, Junyang Lin
arx…

@arXiv_csSE_bot@mastoxiv.page
2025-06-03 07:29:49

CRScore : Reinforcement Learning with Verifiable Tool and AI Feedback for Code Review
Manav Nitin Kapadnis, Atharva Naik, Carolyn Rose
arxiv.org/abs/2506.00296

@BBC3MusicBot@mastodonapp.uk
2025-07-04 17:05:17

🇺🇦 #NowPlaying on BBCRadio3's #InTune
Charles Gounod, Royal Liverpool Philharmonic Orchestra & Libor Pešek:
🎵 Mors et Vita (Judex)
#CharlesGounod #RoyalLiverpoolPhilharmonicOrchestra #LiborPešek

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 07:20:42

Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs
Yufa Zhou, Shaobo Wang, Xingyu Dong, Xiangqi Jin, Yifang Chen, Yue Min, Kexin Yang, Xingzhang Ren, Dayiheng Liu, Linfeng Zhang
arxiv.org/abs/2506.00577

@BBC6MusicBot@mastodonapp.uk
2025-07-03 15:02:11

🇺🇦 #NowPlaying on #BBC6Music's #HuwStephens
Beastie Boys:
🎵 (You Gotta) Fight for Your Right (To Party!)
#BeastieBoys
rolivaroliva.bandcamp.com/trac
open.spotify.com/track/5NLuC70

@gwire@mastodon.social
2025-06-25 08:01:50

> She explained, “I just need the paper. I need to write things down. [The iPad script] also has so many passwords and then I would have ADD and then do something else and then it would lock me out … it was so complicated. So I printed it.”
When actors bypass corporate controls to get their work done, it's a showbiz story.

@mxp@mastodon.acm.org‬
2025-05-30 22:30:09

@… @… You don’t seem to be the only who has this association… I won’t comment, RLPers-VD oblige.

@BBC3MusicBot@mastodonapp.uk
2025-08-04 15:43:05

🇺🇦 #NowPlaying on BBCRadio3's #ComposerOfTheWeek #COTW
Dmitry Shostakovich, Royal Liverpool Philharmonic Orchestra & Vasily Petrenko:
🎵 Symphony No 7 "Leningrad" Op 60 (2nd mvt)
#DmitryShostakovich #RoyalLiverpoolPhilharmonicOrchestra #VasilyPetrenko

@arXiv_csCL_bot@mastoxiv.page
2025-07-22 12:23:50

Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for RLVR
Jiakang Wang, Runze Liu, Fuzheng Zhang, Xiu Li, Guorui Zhou
arxiv.org/abs/2507.15778

@BBC3MusicBot@mastodonapp.uk
2025-07-05 16:09:18

🇺🇦 #NowPlaying on BBCRadio3's #ThisClassicalLife
Dani Howard, Peter Moore, Royal Liverpool Philharmonic Orchestra & Michael Seal:
🎵 Trombone Concerto (1st mvt. Realisation)
#DaniHoward #PeterMoore #RoyalLiverpoolPhilharmonicOrchestra #MichaelSeal