Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csSE_bot@mastoxiv.page
2025-07-18 09:11:32

A Survey of Reinforcement Learning for Software Engineering
Dong Wang, Hanmo You, Lingwei Zhu, Kaiwei Lin, Zheng Chen, Chen Yang, Junji Yu, Zan Wang, Junjie Chen
arxiv.org/abs/2507.12483

@arXiv_csAI_bot@mastoxiv.page
2025-06-18 08:03:26

Discovering Temporal Structure: An Overview of Hierarchical Reinforcement Learning
Martin Klissarov, Akhil Bagaria, Ziyan Luo, George Konidaris, Doina Precup, Marlos C. Machado
arxiv.org/abs/2506.14045

@arXiv_csCL_bot@mastoxiv.page
2025-06-18 09:12:51

Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs
Ring Team, Bin Hu, Cai Chen, Deng Zhao, Ding Liu, Dingnan Jin, Feng Zhu, Hao Dai, Hongzhi Luan, Jia Guo, Jiaming Liu, Jiewei Wu, Jun Mei, Jun Zhou, Junbo Zhao, Junwu Xiong, Kaihong Zhang, Kuan Xu, Lei Liang, Liang Jiang, Liangcheng Fu, Longfei Zheng, Qiang Gao, Qing Cui, Quan Wan, Shaomian Zheng, Shuaicheng Li, Tongkai Yang, Wang Ren, Xiaodong Yan, Xiaopei Wan, Xiaoyun Feng, Xin Zhao, Xinxing Yang, Xinyu …

@arXiv_csRO_bot@mastoxiv.page
2025-07-18 09:51:32

Evaluating Reinforcement Learning Algorithms for Navigation in Simulated Robotic Quadrupeds: A Comparative Study Inspired by Guide Dog Behaviour
Emma M. A. Harrison
arxiv.org/abs/2507.13277

@arXiv_csCE_bot@mastoxiv.page
2025-07-18 07:39:22

Quantum-Enhanced Reinforcement Learning with LSTM Forecasting Signals for Optimizing Fintech Trading Decisions
Yen-Ku Liu, Yun-Huei Pan, Pei-Fan Lu, Yun-Cheng Tsai, Samuel Yen-Chi Chen
arxiv.org/abs/2507.12835

@arXiv_csDC_bot@mastoxiv.page
2025-07-18 08:00:52

Autonomous Resource Management in Microservice Systems via Reinforcement Learning
Yujun Zou, Nia Qi, Yingnan Deng, Zhihao Xue, Ming Gong, Wuyang Zhang
arxiv.org/abs/2507.12879

@arXiv_csCV_bot@mastoxiv.page
2025-06-18 09:15:11

Recognition through Reasoning: Reinforcing Image Geo-localization with Large Vision-Language Models
Ling Li, Yao Zhou, Yuxuan Liang, Fugee Tsung, Jiaheng Wei
arxiv.org/abs/2506.14674

@arXiv_csRO_bot@mastoxiv.page
2025-07-18 09:49:52

Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback
Suzie Kim, Hye-Bin Shin, Seong-Whan Lee
arxiv.org/abs/2507.13171

@arXiv_csCV_bot@mastoxiv.page
2025-06-18 09:05:09

PoseGRAF: Geometric-Reinforced Adaptive Fusion for Monocular 3D Human Pose Estimation
Ming Xu, Xu Zhang
arxiv.org/abs/2506.14596

@arXiv_csRO_bot@mastoxiv.page
2025-06-18 09:23:35

SENIOR: Efficient Query Selection and Preference-Guided Exploration in Preference-based Reinforcement Learning
Hexian Ni, Tao Lu, Haoyuan Hu, Yinghao Cai, Shuo Wang
arxiv.org/abs/2506.14648