Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csLG_bot@mastoxiv.page
2025-06-12 10:03:21

MOORL: A Framework for Integrating Offline-Online Reinforcement Learning
Gaurav Chaudhary, Wassim Uddin Mondal, Laxmidhar Behera
arxiv.org/abs/2506.09574

@arXiv_csCV_bot@mastoxiv.page
2025-06-13 13:54:46

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new/
[3/3]:
Reinforcing Multimodal Understanding and Generation with Dual Self-rewards

@arXiv_csCL_bot@mastoxiv.page
2025-06-12 09:05:02

VerIF: Verification Engineering for Reinforcement Learning in Instruction Following
Hao Peng, Yunjia Qi, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li
arxiv.org/abs/2506.09942

@arXiv_quantph_bot@mastoxiv.page
2025-06-12 09:52:01

Toward Scalable Quantum Compilation for Modular Architecture: Qubit Mapping and Reuse via Deep Reinforcement Learning
Sokea Sang, Leanghok Hour, Youngsun Han
arxiv.org/abs/2506.09323

@arXiv_csRO_bot@mastoxiv.page
2025-06-12 08:46:21

Reinforced Refinement with Self-Aware Expansion for End-to-End Autonomous Driving
Haochen Liu, Tianyu Li, Haohan Yang, Li Chen, Caojun Wang, Ke Guo, Haochen Tian, Hongchen Li, Hongyang Li, Chen Lv
arxiv.org/abs/2506.09800

@arXiv_csCR_bot@mastoxiv.page
2025-06-12 07:26:31

TooBadRL: Trigger Optimization to Boost Effectiveness of Backdoor Attacks on Deep Reinforcement Learning
Songze Li, Mingxuan Zhang, Oubo Ma, Kang Wei, Shouling Ji
arxiv.org/abs/2506.09562

@arXiv_csLG_bot@mastoxiv.page
2025-06-12 10:02:11

Efficient Preference-Based Reinforcement Learning: Randomized Exploration Meets Experimental Design
Andreas Schlaginhaufen, Reda Ouhamma, Maryam Kamgarpour
arxiv.org/abs/2506.09508

@arXiv_csLG_bot@mastoxiv.page
2025-06-12 09:56:41

Synergizing Reinforcement Learning and Genetic Algorithms for Neural Combinatorial Optimization
Shengda Gu, Kai Li, Junliang Xing, Yifan Zhang, Jian Cheng
arxiv.org/abs/2506.09404

@arXiv_csLG_bot@mastoxiv.page
2025-06-12 08:43:51

Policy-Based Trajectory Clustering in Offline Reinforcement Learning
Hao Hu, Xinqi Wang, Simon Shaolei Du
arxiv.org/abs/2506.09202

@arXiv_csLG_bot@mastoxiv.page
2025-06-13 14:17:03

Replaced article(s) found for cs.LG. arxiv.org/list/cs.LG/new/
[6/6]:
TooBadRL: Trigger Optimization to Boost Effectiveness of Backdoor Attacks on Deep Reinforcement L...