Q&A with researcher Petter Törnberg on his pre-print study showing how social media's structural architecture creates problematic outcomes like echo chambers (Jennifer Ouellette/Ars Technica)
https://arstechnica.com/science/2025/08/study-social-medi…
Stabilizing Long-term Multi-turn Reinforcement Learning with Gated Rewards
Zetian Sun, Dongfang Li, Zhuoen Chen, Yuhuai Qin, Baotian Hu
https://arxiv.org/abs/2508.10548 https://…
SSRL: Self-Search Reinforcement Learning
Yuchen Fan, Kaiyan Zhang, Heng Zhou, Yuxin Zuo, Yanxu Chen, Yu Fu, Xinwei Long, Xuekai Zhu, Che Jiang, Yuchen Zhang, Li Kang, Gang Chen, Cheng Huang, Zhizhou He, Bingning Wang, Lei Bai, Ning Ding, Bowen Zhou
https://arxiv.org/abs/2508.10874
While the Brazilian government is selling off the Amazon Rainforest to fossil fuel companies, banks like Banco Santander are bankrolling them to further destroy the Amazon. Without money, fossil fuel giants can’t drill. Sign the petition and tell Santander, stop destroying the Amazon! https://act.350.org/sign/Santander-ama
A Curriculum Learning Approach to Reinforcement Learning: Leveraging RAG for Multimodal Question Answering
Chenliang Zhang, Lin Wang, Yuanyuan Lu, Yusheng Qi, Kexin Wang, Peixu Hou, Wenshi Chen
https://arxiv.org/abs/2508.10337
UI-Venus Technical Report: Building High-performance UI Agents with RFT
Zhangxuan Gu, Zhengwen Zeng, Zhenyu Xu, Xingran Zhou, Shuheng Shen, Yunfei Liu, Beitong Zhou, Changhua Meng, Tianyu Xia, Weizhi Chen, Yue Wen, Jingya Dou, Fei Tang, Jinzhen Lin, Yulin Liu, Zhenlin Guo, Yichen Gong, Heng Jia, Changlong Gao, Yuan Guo, Yong Deng, Zhenyu Guo, Liang Chen, Weiqiang Wang
Inpainting-Guided Policy Optimization for Diffusion Large Language Models
Siyan Zhao, Mengchen Liu, Jing Huang, Miao Liu, Chenyu Wang, Bo Liu, Yuandong Tian, Guan Pang, Sean Bell, Aditya Grover, Feiyu Chen
https://arxiv.org/abs/2509.10396
Topic-Guided Reinforcement Learning with LLMs for Enhancing Multi-Document Summarization
Chuyuan Li, Austin Xu, Shafiq Joty, Giuseppe Carenini
https://arxiv.org/abs/2509.09852 h…
Arintra, whose AI medical coding system translates clinical documentation into insurance codes for healthcare providers, raised a $21M Series A led by Peak XV (Erin Brodwin/Axios)
https://www.axios.com/pro/health-tech-deals/2025/08/12/arintra-21m…
BigCharts-R1: Enhanced Chart Reasoning with Visual Reinforcement Finetuning
Ahmed Masry, Abhay Puri, Masoud Hashemi, Juan A. Rodriguez, Megh Thakkar, Khyati Mahajan, Vikas Yadav, Sathwik Tejaswi Madhusudhan, Alexandre Pich\'e, Dzmitry Bahdanau, Christopher Pal, David Vazquez, Enamul Hoque, Perouz Taslakian, Sai Rajeswar, Spandana Gella
https://