Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csRO_bot@mastoxiv.page
2025-06-19 08:34:23

Booster Gym: An End-to-End Reinforcement Learning Framework for Humanoid Robot Locomotion
Yushi Wang, Penghui Chen, Xinyu Han, Feng Wu, Mingguo Zhao
arxiv.org/abs/2506.15132

@kexpmusicbot@mastodonapp.uk
2025-06-18 23:51:09

🇺🇦 #NowPlaying on KEXP's #DriveTime
Björk:
🎵 There’s More to Life Than This (recorded live at the Milk Bar toilets)
#Björk
bjork.bandcamp.com/track/there
open.spotify.com/track/2runFrI

@arXiv_eessSY_bot@mastoxiv.page
2025-06-19 08:44:37

Make Your AUV Adaptive: An Environment-Aware Reinforcement Learning Framework For Underwater Tasks
Yimian Ding, Jingzehua Xu, Guanwen Xie, Shuai Zhang, Yi Li
arxiv.org/abs/2506.15082

@arXiv_csNI_bot@mastoxiv.page
2025-06-19 08:28:34

GCN-Driven Reinforcement Learning for Probabilistic Real-Time Guarantees in Industrial URLLC
Eman Alqudah, Ashfaq Khokhar
arxiv.org/abs/2506.15011

@arXiv_csAI_bot@mastoxiv.page
2025-06-18 08:03:26

Discovering Temporal Structure: An Overview of Hierarchical Reinforcement Learning
Martin Klissarov, Akhil Bagaria, Ziyan Luo, George Konidaris, Doina Precup, Marlos C. Machado
arxiv.org/abs/2506.14045

@arXiv_eessSP_bot@mastoxiv.page
2025-06-19 08:44:12

Reinforcement Learning-Based Policy Optimisation For Heterogeneous Radio Access
Anup Mishra, \v{C}edomir Stefanovi\'c, Xiuqiang Xu, Petar Popovski, Israel Leyva-Mayorga
arxiv.org/abs/2506.15273

@arXiv_csCL_bot@mastoxiv.page
2025-06-18 09:12:51

Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs
Ring Team, Bin Hu, Cai Chen, Deng Zhao, Ding Liu, Dingnan Jin, Feng Zhu, Hao Dai, Hongzhi Luan, Jia Guo, Jiaming Liu, Jiewei Wu, Jun Mei, Jun Zhou, Junbo Zhao, Junwu Xiong, Kaihong Zhang, Kuan Xu, Lei Liang, Liang Jiang, Liangcheng Fu, Longfei Zheng, Qiang Gao, Qing Cui, Quan Wan, Shaomian Zheng, Shuaicheng Li, Tongkai Yang, Wang Ren, Xiaodong Yan, Xiaopei Wan, Xiaoyun Feng, Xin Zhao, Xinxing Yang, Xinyu …

@izzychambers@vivaldi.net
2025-06-19 12:11:15

@… Two or three years ago, an old friend was in town visiting. We had lunch and discussed, among other things, our biggest fears. She said fascism, followed by climate change. I said climate change, followed by fascism. I guess she won, at least in the short term. And we agreed that the two reinforce each other.

@arXiv_physicsoptics_bot@mastoxiv.page
2025-06-19 10:02:07

Design of an all-facet illuminator for high NA EUV lithography exposure tool based on deep reinforcement learning
Tong Li, Yuqing Chen, Yanqiu Li, Lihui Liu
arxiv.org/abs/2506.15558

@arXiv_csRO_bot@mastoxiv.page
2025-06-18 09:23:35

SENIOR: Efficient Query Selection and Preference-Guided Exploration in Preference-based Reinforcement Learning
Hexian Ni, Tao Lu, Haoyuan Hu, Yinghao Cai, Shuo Wang
arxiv.org/abs/2506.14648