Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csLG_bot@mastoxiv.page
2025-07-17 10:13:20

Online Training and Pruning of Deep Reinforcement Learning Networks
Valentin Frank Ingmar Guenter, Athanasios Sideris
arxiv.org/abs/2507.11975

@arXiv_csNI_bot@mastoxiv.page
2025-06-17 10:03:13

Dynamic Preference Multi-Objective Reinforcement Learning for Internet Network Management
DongNyeong Heo, Daniela Noemi Rim, Heeyoul Choi
arxiv.org/abs/2506.13153

@arXiv_condmatquantgas_bot@mastoxiv.page
2025-07-17 08:37:40

Efficient Preparation of Fermionic Superfluids in an Optical Dipole Trap through Reinforcement Learning
Yueyang Min, Ziliang Li, Yi Zhong, Jia-An Xuan, Jian Lin, Fei Leng, Xiaopeng Li
arxiv.org/abs/2507.12152

@arXiv_eessSY_bot@mastoxiv.page
2025-06-17 10:14:09

Wasserstein-Barycenter Consensus for Cooperative Multi-Agent Reinforcement Learning
Ali Baheri
arxiv.org/abs/2506.12497

@arXiv_statML_bot@mastoxiv.page
2025-06-17 12:09:30

Theoretical Tensions in RLHF: Reconciling Empirical Success with Inconsistencies in Social Choice Theory
Jiancong Xiao, Zhekun Shi, Kaizhao Liu, Qi Long, Weijie J. Su
arxiv.org/abs/2506.12350

@arXiv_physicscompph_bot@mastoxiv.page
2025-06-17 11:16:09

Analytical coarse grained potential parameterization by Reinforcement Learning for anisotropic cellulose
Xu Don
arxiv.org/abs/2506.12893

@arXiv_condmatsoft_bot@mastoxiv.page
2025-06-16 08:59:59

Vane rheology of a fiber-reinforced granular material
Ladislas Wierzchalek, Georges Gauthier, Baptiste Darbois-Texier
arxiv.org/abs/2506.11762

@arXiv_csRO_bot@mastoxiv.page
2025-07-16 10:05:41

Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation
Yanbo Wang, Zipeng Fang, Lei Zhao, Weidong Chen
arxiv.org/abs/2507.11001

@arXiv_csMA_bot@mastoxiv.page
2025-06-17 09:45:20

Trust-MARL: Trust-Based Multi-Agent Reinforcement Learning Framework for Cooperative On-Ramp Merging Control in Heterogeneous Traffic Flow
Jie Pan, Tianyi Wang, Christian Claudel, Jing Shi
arxiv.org/abs/2506.12600

@arXiv_csLG_bot@mastoxiv.page
2025-07-17 10:13:10

Kevin: Multi-Turn RL for Generating CUDA Kernels
Carlo Baronio, Pietro Marsella, Ben Pan, Simon Guo, Silas Alberti
arxiv.org/abs/2507.11948