Self-correcting Reward Shaping via Language Models for Reinforcement Learning Agents in Games
Ant\'onio Afonso, Iolanda Leite, Alessandro Sestini, Florian Fuchs, Konrad Tollmar, Linus Gissl\'en
https://arxiv.org/abs/2506.23626
Distributed Neural Policy Gradient Algorithm for Global Convergence of Networked Multi-Agent Reinforcement Learning
Pengcheng Dai, Yuanqiu Mo, Wenwu Yu, Wei Ren
https://arxiv.org/abs/2505.24113
Replaced article(s) found for stat.ML. https://arxiv.org/list/stat.ML/new
[1/2]:
- Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon Settings
C. Shi, S. Zhang, W. Lu, R. Song
Genetic Informed Trees (GIT*): Path Planning via Reinforced Genetic Programming Heuristics
Liding Zhang, Kuanqi Cai, Zhenshan Bing, Chaoqun Wang, Alois Knoll
https://arxiv.org/abs/2508.20871
Digital Twin-Empowered Deep Reinforcement Learning for Intelligent VNF Migration in Edge-Core Networks
Faisal Ahmed, Suresh Subramaniam, Motoharu Matsuura, Hiroshi Hasegawa, Shih-Chun Lin
https://arxiv.org/abs/2508.20957
Hierarchical Reinforcement Learning Framework for Adaptive Walking Control Using General Value Functions of Lower-Limb Sensor Signals
Sonny T. Jones, Grange M. Simpson, Patrick M. Pilarski, Ashley N. Dalrymple
https://arxiv.org/abs/2507.16983
Model Predictive Adversarial Imitation Learning for Planning from Observation
Tyler Han, Yanda Bao, Bhaumik Mehta, Gabriel Guo, Anubhav Vishwakarma, Emily Kang, Sanghun Jung, Rosario Scalise, Jason Zhou, Bryan Xu, Byron Boots
https://arxiv.org/abs/2507.21533
Stability and Generalization for Bellman Residuals
Enoch H. Kang, Kyoungseok Jang
https://arxiv.org/abs/2508.18741 https://arxiv.org/pdf/2508.18741
CHIMERA: Compressed Hybrid Intelligence for Twin-Model Enhanced Multi-Agent Deep Reinforcement Learning for Multi-Functional RIS-Assisted Space-Air-Ground Integrated Networks
Li-Hsiang Shen, Jyun-Jhe Huang
https://arxiv.org/abs/2507.16204
QuadKAN: KAN-Enhanced Quadruped Motion Control via End-to-End Reinforcement Learning
Allen Wang, Gavin Tao
https://arxiv.org/abs/2508.19153 https://arxiv.o…