Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csAI_bot@mastoxiv.page
2025-09-01 08:36:33

Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models
Yi Liao, Yu Gu, Yuan Sui, Zining Zhu, Yifan Lu, Guohua Tang, Zhongqian Sun, Wei Yang
arxiv.org/abs/2508.21365

@arXiv_csIR_bot@mastoxiv.page
2025-06-30 09:51:20

Reward Balancing Revisited: Enhancing Offline Reinforcement Learning for Recommender Systems
Wenzheng Shu, Yanxiang Zeng, Yongxiang Tang, Teng Sha, Ning Luo, Yanhua Cheng, Xialong Liu, Fan Zhou, Peng Jiang
arxiv.org/abs/2506.22112

@arXiv_csCV_bot@mastoxiv.page
2025-07-30 10:42:21

X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again
Zigang Geng, Yibing Wang, Yeyao Ma, Chen Li, Yongming Rao, Shuyang Gu, Zhao Zhong, Qinglin Lu, Han Hu, Xiaosong Zhang, Linus, Di Wang, Jie Jiang
arxiv.org/abs/2507.22058

@arXiv_eessSY_bot@mastoxiv.page
2025-07-31 08:46:21

Toward Trusted Onboard AI: Advancing Small Satellite Operations using Reinforcement Learning
Cannon Whitney, Joseph Melville
arxiv.org/abs/2507.22198

@arXiv_csCL_bot@mastoxiv.page
2025-08-28 10:01:31

Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning
Sikuan Yan, Xiufeng Yang, Zuchao Huang, Ercong Nie, Zifeng Ding, Zonggen Li, Xiaowen Ma, Hinrich Sch\"utze, Volker Tresp, Yunpu Ma
arxiv.org/abs/2508.19828

@usul@piaille.fr
2025-06-28 10:58:07

Doucement mais sûrement, fumer va devenir compliqué....
Merci aux fumeurs pour leurs efforts
L’interdiction de fumer dans les parcs, Š la plage et aux abords des écoles entre en vigueur dès dimanche

@arXiv_csRO_bot@mastoxiv.page
2025-08-29 09:47:21

Genetic Informed Trees (GIT*): Path Planning via Reinforced Genetic Programming Heuristics
Liding Zhang, Kuanqi Cai, Zhenshan Bing, Chaoqun Wang, Alois Knoll
arxiv.org/abs/2508.20871

@arXiv_csHC_bot@mastoxiv.page
2025-08-22 09:32:31

Demystifying Reward Design in Reinforcement Learning for Upper Extremity Interaction: Practical Guidelines for Biomechanical Simulations in HCI
Hannah Selder, Florian Fischer, Per Ola Kristensson, Arthur Fleig
arxiv.org/abs/2508.15727

@primonatura@mstdn.social
2025-08-22 17:00:42

"Belize project seeks out heat-resilient corals to protect its reefs"
#Belize #CoralReef #Environment

@arXiv_csLG_bot@mastoxiv.page
2025-07-17 10:25:50

Improving Reinforcement Learning Sample-Efficiency using Local Approximation
Mohit Prashant, Arvind Easwaran
arxiv.org/abs/2507.12383