Booster Gym: An End-to-End Reinforcement Learning Framework for Humanoid Robot Locomotion
Yushi Wang, Penghui Chen, Xinyu Han, Feng Wu, Mingguo Zhao
https://arxiv.org/abs/2506.15132
🇺🇦 #NowPlaying on KEXP's #DriveTime
Björk:
🎵 There’s More to Life Than This (recorded live at the Milk Bar toilets)
#Björk
https://bjork.bandcamp.com/track/theres-more-to-life-than-this-recorded-live-at-the-milk-bar-toilets
https://open.spotify.com/track/2runFrIwOg5p2HZo1oymEL
Make Your AUV Adaptive: An Environment-Aware Reinforcement Learning Framework For Underwater Tasks
Yimian Ding, Jingzehua Xu, Guanwen Xie, Shuai Zhang, Yi Li
https://arxiv.org/abs/2506.15082
GCN-Driven Reinforcement Learning for Probabilistic Real-Time Guarantees in Industrial URLLC
Eman Alqudah, Ashfaq Khokhar
https://arxiv.org/abs/2506.15011 …
Discovering Temporal Structure: An Overview of Hierarchical Reinforcement Learning
Martin Klissarov, Akhil Bagaria, Ziyan Luo, George Konidaris, Doina Precup, Marlos C. Machado
https://arxiv.org/abs/2506.14045
Reinforcement Learning-Based Policy Optimisation For Heterogeneous Radio Access
Anup Mishra, \v{C}edomir Stefanovi\'c, Xiuqiang Xu, Petar Popovski, Israel Leyva-Mayorga
https://arxiv.org/abs/2506.15273
Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs
Ring Team, Bin Hu, Cai Chen, Deng Zhao, Ding Liu, Dingnan Jin, Feng Zhu, Hao Dai, Hongzhi Luan, Jia Guo, Jiaming Liu, Jiewei Wu, Jun Mei, Jun Zhou, Junbo Zhao, Junwu Xiong, Kaihong Zhang, Kuan Xu, Lei Liang, Liang Jiang, Liangcheng Fu, Longfei Zheng, Qiang Gao, Qing Cui, Quan Wan, Shaomian Zheng, Shuaicheng Li, Tongkai Yang, Wang Ren, Xiaodong Yan, Xiaopei Wan, Xiaoyun Feng, Xin Zhao, Xinxing Yang, Xinyu …
@… Two or three years ago, an old friend was in town visiting. We had lunch and discussed, among other things, our biggest fears. She said fascism, followed by climate change. I said climate change, followed by fascism. I guess she won, at least in the short term. And we agreed that the two reinforce each other.
Design of an all-facet illuminator for high NA EUV lithography exposure tool based on deep reinforcement learning
Tong Li, Yuqing Chen, Yanqiu Li, Lihui Liu
https://arxiv.org/abs/2506.15558
SENIOR: Efficient Query Selection and Preference-Guided Exploration in Preference-based Reinforcement Learning
Hexian Ni, Tao Lu, Haoyuan Hu, Yinghao Cai, Shuo Wang
https://arxiv.org/abs/2506.14648