Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models
Yi Liao, Yu Gu, Yuan Sui, Zining Zhu, Yifan Lu, Guohua Tang, Zhongqian Sun, Wei Yang
https://arxiv.org/abs/2508.21365
Reward Balancing Revisited: Enhancing Offline Reinforcement Learning for Recommender Systems
Wenzheng Shu, Yanxiang Zeng, Yongxiang Tang, Teng Sha, Ning Luo, Yanhua Cheng, Xialong Liu, Fan Zhou, Peng Jiang
https://arxiv.org/abs/2506.22112
X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again
Zigang Geng, Yibing Wang, Yeyao Ma, Chen Li, Yongming Rao, Shuyang Gu, Zhao Zhong, Qinglin Lu, Han Hu, Xiaosong Zhang, Linus, Di Wang, Jie Jiang
https://arxiv.org/abs/2507.22058
Toward Trusted Onboard AI: Advancing Small Satellite Operations using Reinforcement Learning
Cannon Whitney, Joseph Melville
https://arxiv.org/abs/2507.22198 https://
Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning
Sikuan Yan, Xiufeng Yang, Zuchao Huang, Ercong Nie, Zifeng Ding, Zonggen Li, Xiaowen Ma, Hinrich Sch\"utze, Volker Tresp, Yunpu Ma
https://arxiv.org/abs/2508.19828
Doucement mais sûrement, fumer va devenir compliqué....
Merci aux fumeurs pour leurs efforts
L’interdiction de fumer dans les parcs, Š la plage et aux abords des écoles entre en vigueur dès dimanche
https://…
Genetic Informed Trees (GIT*): Path Planning via Reinforced Genetic Programming Heuristics
Liding Zhang, Kuanqi Cai, Zhenshan Bing, Chaoqun Wang, Alois Knoll
https://arxiv.org/abs/2508.20871
Demystifying Reward Design in Reinforcement Learning for Upper Extremity Interaction: Practical Guidelines for Biomechanical Simulations in HCI
Hannah Selder, Florian Fischer, Per Ola Kristensson, Arthur Fleig
https://arxiv.org/abs/2508.15727
"Belize project seeks out heat-resilient corals to protect its reefs"
#Belize #CoralReef #Environment
Improving Reinforcement Learning Sample-Efficiency using Local Approximation
Mohit Prashant, Arvind Easwaran
https://arxiv.org/abs/2507.12383 https://