Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csRO_bot@mastoxiv.page
2025-07-18 09:51:32

Evaluating Reinforcement Learning Algorithms for Navigation in Simulated Robotic Quadrupeds: A Comparative Study Inspired by Guide Dog Behaviour
Emma M. A. Harrison
arxiv.org/abs/2507.13277

@arXiv_csAI_bot@mastoxiv.page
2025-06-18 08:04:29

Situational-Constrained Sequential Resources Allocation via Reinforcement Learning
Libo Zhang, Yang Chen, Toru Takisaka, Kaiqi Zhao, Weidong Li, Jiamou Liu
arxiv.org/abs/2506.14125

@arXiv_csLG_bot@mastoxiv.page
2025-07-17 10:13:20

Online Training and Pruning of Deep Reinforcement Learning Networks
Valentin Frank Ingmar Guenter, Athanasios Sideris
arxiv.org/abs/2507.11975

@arXiv_csRO_bot@mastoxiv.page
2025-06-18 09:23:35

SENIOR: Efficient Query Selection and Preference-Guided Exploration in Preference-based Reinforcement Learning
Hexian Ni, Tao Lu, Haoyuan Hu, Yinghao Cai, Shuo Wang
arxiv.org/abs/2506.14648

@arXiv_eessSY_bot@mastoxiv.page
2025-07-17 08:56:30

Towards Ultra-Reliable 6G in-X Subnetworks: Dynamic Link Adaptation by Deep Reinforcement Learning
Fateme Salehi, Aamir Mahmood, Sarder Fakhrul Abedin, Kyi Thar, Mikael Gidlund
arxiv.org/abs/2507.12031

@arXiv_csDB_bot@mastoxiv.page
2025-06-16 07:26:09

LLM-based Dynamic Differential Testing for Database Connectors with Reinforcement Learning-Guided Prompt Selection
Ce Lyu, Minghao Zhao, Yanhao Wang, Liang Jie
arxiv.org/abs/2506.11870

@arXiv_csAI_bot@mastoxiv.page
2025-07-16 10:17:21

Illuminating the Three Dogmas of Reinforcement Learning under Evolutionary Light
Mani Hamidi, Terrence W. Deacon
arxiv.org/abs/2507.11482

@arXiv_csNI_bot@mastoxiv.page
2025-06-16 07:47:49

Generalised Rate Control Approach For Stream Processing Applications
Ziren Xiao
arxiv.org/abs/2506.11710 arxiv.org/pd…

@arXiv_csCR_bot@mastoxiv.page
2025-07-08 07:54:20

Reinforcement Learning for Automated Cybersecurity Penetration Testing
Daniel L\'opez-Montero, Jos\'e L. \'Alvarez-Aldana, Alicia Morales-Mart\'inez, Marta Gil-L\'opez, Juan M. Au\~n\'on Garc\'ia
arxiv.org/abs/2507.02969

@arXiv_csAI_bot@mastoxiv.page
2025-07-11 07:58:31

Application of LLMs to Multi-Robot Path Planning and Task Allocation
Ashish Kumar
arxiv.org/abs/2507.07302 arxiv.org/…