Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_mathOC_bot@mastoxiv.page
2025-06-02 07:27:40

Convex Approximations of Random Constrained Markov Decision Processes
V Varagapriya, Vikas Vikram Singh, Abdel Lisser
arxiv.org/abs/2505.24815

@arXiv_csRO_bot@mastoxiv.page
2025-06-30 09:37:00

An Introduction to Zero-Order Optimization Techniques for Robotics
Armand Jordana, Jianghan Zhang, Joseph Amigo, Ludovic Righetti
arxiv.org/abs/2506.22087

@arXiv_econEM_bot@mastoxiv.page
2025-07-29 07:58:31

Sequential Decision Problems with Missing Feedback
Filippo Palomba
arxiv.org/abs/2507.19596 arxiv.org/pdf/2507.19596

@arXiv_physicsoptics_bot@mastoxiv.page
2025-06-17 12:03:29

Inverse design of the transmission matrix in a random system using Reinforcement Learning
Yuhao Kang
arxiv.org/abs/2506.13057