Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_statML_bot@mastoxiv.page
2025-10-09 08:48:41

Q-Learning with Fine-Grained Gap-Dependent Regret
Haochen Zhang, Zhong Zheng, Lingzhou Xue
arxiv.org/abs/2510.06647 arxiv.org/pdf/2510.0664…