Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csHC_bot@mastoxiv.page
2025-09-15 09:37:11

The Language of Approval: Identifying the Drivers of Positive Feedback Online
Agam Goyal, Charlotte Lambert, Eshwar Chandrasekharan
arxiv.org/abs/2509.10370

@Mediagazer@mstdn.social
2025-11-13 22:31:05

Sources: the first-round bid deadline for WBD is November 20; Paramount wants the full company, and Comcast and Netflix are eying movie/TV studios and HBO Max (Wall Street Journal)
wsj.com/business/media/paramou

@arXiv_csCL_bot@mastoxiv.page
2025-10-13 10:33:30

Token-Level Policy Optimization: Linking Group-Level Rewards to Token-Level Aggregation via Markov Likelihood
Xingyu Lin, Yilin Wen, En Wang, Du Su, Wenbin Liu, Chenfu Bao, Zhonghou Lv
arxiv.org/abs/2510.09369

@arXiv_csIR_bot@mastoxiv.page
2025-10-15 08:48:12

Reinforced Preference Optimization for Recommendation
Junfei Tan, Yuxin Chen, An Zhang, Junguang Jiang, Bin Liu, Ziru Xu, Han Zhu, Jian Xu, Bo Zheng, Xiang Wang
arxiv.org/abs/2510.12211

@arXiv_csSI_bot@mastoxiv.page
2025-09-15 08:31:21

TikTok Rewards Divisive Political Messaging During the 2025 German Federal Election
Kirill Solovev, Chiara Drolsbach, Emma Demirel, Nicolas Pr\"ollochs
arxiv.org/abs/2509.10336

@arXiv_csLG_bot@mastoxiv.page
2025-10-13 10:46:40

BaNEL: Exploration Posteriors for Generative Modeling Using Only Negative Rewards
Sangyun Lee, Brandon Amos, Giulia Fanti
arxiv.org/abs/2510.09596

@Techmeme@techhub.social
2025-11-10 23:20:45

The US Treasury and IRS issue guidance allowing crypto products to offer staking rewards under a new safe harbor (Sander Lutz/Decrypt)
decrypt.co/348044/ethereum-sol

@arXiv_csAI_bot@mastoxiv.page
2025-10-14 12:23:38

From <Answer> to <Think>: Multidimensional Supervision of Reasoning Process for LLM Optimization
Beining Wang, Weihang Su, Hongtao Tian, Tao Yang, Yujia Zhou, Ting Yao, Qingyao Ai, Yiqun Liu
arxiv.org/abs/2510.11457

@arXiv_csRO_bot@mastoxiv.page
2025-10-13 10:13:20

Guiding Energy-Efficient Locomotion through Impact Mitigation Rewards
Chenghao Wang, Arjun Viswanathan, Eric Sihite, Alireza Ramezani
arxiv.org/abs/2510.09543

@arXiv_csCL_bot@mastoxiv.page
2025-10-15 10:44:11

Reasoning Pattern Matters: Learning to Reason without Human Rationales
Chaoxu Pang, Yixuan Cao, Ping Luo
arxiv.org/abs/2510.12643 arxiv.org…