Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models
Yinjie Wang, Ling Yang, Bowen Li, Ye Tian, Ke Shen, Mengdi Wang
https://arxiv.org/abs/2509.06949
Reinforcement Learning for Clinical Reasoning: Aligning LLMs with ACR Imaging Appropriateness Criteria
Anni Tziakouri, Filippo Menolascina
https://arxiv.org/abs/2510.05194 https…
SmartFlow: A CFD-solver-agnostic deep reinforcement learning framework for computational fluid dynamics on HPC platforms
Maochao Xiao, Yuning Wang, Felix Rodach, Bernat Font, Marius Kurz, Pol Su\'arez, Di Zhou, Francisco Alc\'antara-\'Avila, Ting Zhu, Junle Liu, Ricard Montal\`a, Jiawei Chen, Jean Rabault, Oriol Lehmkuhl, Andrea Beck, Johan Larsson, Ricardo Vinuesa, Sergio Pirozzoli
Gerade konnte ich im aktuellen Augustin eine "Stadt unterm Radar" vorstellen und schon bereise ich die nächste - die außerhalb Österreichs gelegene Geburtsstadt eines von der habsburgischen Militärgerichtsbarkeit hingerichteten Revolutionärs, an den auf dem nach ihm benannten Platz eine 1958 errichtete Statue erinnert. Ein Tipp: Schriftsteller*innen wie Friedrich Wolf, Franz Fleischhacker und Eva Priester haben sich den Taten des hier Gewürdigten und seiner Genossen gewidmet.
Hierarchical Reinforcement Learning Framework for Adaptive Walking Control Using General Value Functions of Lower-Limb Sensor Signals
Sonny T. Jones, Grange M. Simpson, Patrick M. Pilarski, Ashley N. Dalrymple
https://arxiv.org/abs/2507.16983
I have extended thoughts on a few nuances of burnout, resilience, and employment
Before taking time off for burnout, my skip manager reminded me to read the strongly positive 360 feedback from my reports. That's both a shallow and a deep reinforcement of resilience, first and foremost by rebuilding and grounding self-confidence. Reading positive feedback provides evidence that I'm capable and effective at my job.
Beyond self-confidence, I have other needs in a workplace, …
Knowledge Defined Networking for 6G: A Reinforcement Learning Example for Resource Management
Erol Ko\c{c}o\u{g}lu, Mehmet Ozdem, Tu\u{g}\c{c}e Bilen
https://arxiv.org/abs/2509.26075
Deep Reinforcement Learning for Active Flow Control around a Three-Dimensional Flow-Separated Wing at Re = 1,000
R. Montal\`a, B. Font, P. Su\'arez, J. Rabault, O. Lehmkuhl, R. Vinuesa, I. Rodriguez
https://arxiv.org/abs/2509.10195
Learning in an Echo Chamber: Online Learning with Replay Adversary
Daniil Dmitriev, Harald Eskelund Franck, Carolin Heinzler, Amartya Sanyal
https://arxiv.org/abs/2509.25135 htt…
Discovering Flow Separation Control Strategies in 3D Wings via Deep Reinforcement Learning
R. Montal\`a, B. Font, P. Su\'arez, J. Rabault, O. Lehmkuhl, R. Vinuesa, I. Rodriguez
https://arxiv.org/abs/2509.10185