Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csLG_bot@mastoxiv.page
2025-06-12 10:03:21

MOORL: A Framework for Integrating Offline-Online Reinforcement Learning
Gaurav Chaudhary, Wassim Uddin Mondal, Laxmidhar Behera
arxiv.org/abs/2506.09574

@arXiv_physicsfludyn_bot@mastoxiv.page
2025-06-13 09:25:00

Attention on flow control: transformer-based reinforcement learning for lift regulation in highly disturbed flows
Zhecheng Liu, Jeff D. Eldredge
arxiv.org/abs/2506.10153

@arXiv_csDC_bot@mastoxiv.page
2025-06-13 07:44:50

Adaptive Job Scheduling in Quantum Clouds Using Reinforcement Learning
Waylon Luo (Kent State University), Jiapeng Zhao (Cisco), Tong Zhan (Meta), Qiang Guan (Kent State University)
arxiv.org/abs/2506.10889

@arXiv_csCL_bot@mastoxiv.page
2025-06-12 09:05:02

VerIF: Verification Engineering for Reinforcement Learning in Instruction Following
Hao Peng, Yunjia Qi, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li
arxiv.org/abs/2506.09942

@arXiv_csRO_bot@mastoxiv.page
2025-06-12 08:46:21

Reinforced Refinement with Self-Aware Expansion for End-to-End Autonomous Driving
Haochen Liu, Tianyu Li, Haohan Yang, Li Chen, Caojun Wang, Ke Guo, Haochen Tian, Hongchen Li, Hongyang Li, Chen Lv
arxiv.org/abs/2506.09800

@arXiv_quantph_bot@mastoxiv.page
2025-06-12 09:52:01

Toward Scalable Quantum Compilation for Modular Architecture: Qubit Mapping and Reuse via Deep Reinforcement Learning
Sokea Sang, Leanghok Hour, Youngsun Han
arxiv.org/abs/2506.09323

@arXiv_csCR_bot@mastoxiv.page
2025-06-12 07:26:31

TooBadRL: Trigger Optimization to Boost Effectiveness of Backdoor Attacks on Deep Reinforcement Learning
Songze Li, Mingxuan Zhang, Oubo Ma, Kang Wei, Shouling Ji
arxiv.org/abs/2506.09562

@arXiv_csLG_bot@mastoxiv.page
2025-06-12 10:02:11

Efficient Preference-Based Reinforcement Learning: Randomized Exploration Meets Experimental Design
Andreas Schlaginhaufen, Reda Ouhamma, Maryam Kamgarpour
arxiv.org/abs/2506.09508

@arXiv_csLG_bot@mastoxiv.page
2025-06-12 09:56:41

Synergizing Reinforcement Learning and Genetic Algorithms for Neural Combinatorial Optimization
Shengda Gu, Kai Li, Junliang Xing, Yifan Zhang, Jian Cheng
arxiv.org/abs/2506.09404

@arXiv_csLG_bot@mastoxiv.page
2025-06-12 08:43:51

Policy-Based Trajectory Clustering in Offline Reinforcement Learning
Hao Hu, Xinqi Wang, Simon Shaolei Du
arxiv.org/abs/2506.09202