MOORL: A Framework for Integrating Offline-Online Reinforcement Learning
Gaurav Chaudhary, Wassim Uddin Mondal, Laxmidhar Behera
https://arxiv.org/abs/2506.09574
Attention on flow control: transformer-based reinforcement learning for lift regulation in highly disturbed flows
Zhecheng Liu, Jeff D. Eldredge
https://arxiv.org/abs/2506.10153
Adaptive Job Scheduling in Quantum Clouds Using Reinforcement Learning
Waylon Luo (Kent State University), Jiapeng Zhao (Cisco), Tong Zhan (Meta), Qiang Guan (Kent State University)
https://arxiv.org/abs/2506.10889
VerIF: Verification Engineering for Reinforcement Learning in Instruction Following
Hao Peng, Yunjia Qi, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li
https://arxiv.org/abs/2506.09942 …
Reinforced Refinement with Self-Aware Expansion for End-to-End Autonomous Driving
Haochen Liu, Tianyu Li, Haohan Yang, Li Chen, Caojun Wang, Ke Guo, Haochen Tian, Hongchen Li, Hongyang Li, Chen Lv
https://arxiv.org/abs/2506.09800
Toward Scalable Quantum Compilation for Modular Architecture: Qubit Mapping and Reuse via Deep Reinforcement Learning
Sokea Sang, Leanghok Hour, Youngsun Han
https://arxiv.org/abs/2506.09323
TooBadRL: Trigger Optimization to Boost Effectiveness of Backdoor Attacks on Deep Reinforcement Learning
Songze Li, Mingxuan Zhang, Oubo Ma, Kang Wei, Shouling Ji
https://arxiv.org/abs/2506.09562
Efficient Preference-Based Reinforcement Learning: Randomized Exploration Meets Experimental Design
Andreas Schlaginhaufen, Reda Ouhamma, Maryam Kamgarpour
https://arxiv.org/abs/2506.09508
Synergizing Reinforcement Learning and Genetic Algorithms for Neural Combinatorial Optimization
Shengda Gu, Kai Li, Junliang Xing, Yifan Zhang, Jian Cheng
https://arxiv.org/abs/2506.09404
Policy-Based Trajectory Clustering in Offline Reinforcement Learning
Hao Hu, Xinqi Wang, Simon Shaolei Du
https://arxiv.org/abs/2506.09202 https://