DSCC-HS: A Dynamic Self-Reinforcing Framework for Hallucination Suppression in Large Language Models
Xiao Zheng
https://arxiv.org/abs/2509.13702 https://ar…
A study published in 2021 presented cuttlefish with a new version of the "marshmallow test",
and the results showed there's more going on in their strange little brains than we ever suspected.
Their ability to learn, anticipate future rewards, and adapt their behavior, the researchers said,
may have evolved to give cuttlefish an edge in the cutthroat eat-or-be-eaten marine world they live
Federated Multi-Agent Reinforcement Learning for Privacy-Preserving and Energy-Aware Resource Management in 6G Edge Networks
Francisco Javier Esono Nkulu Andong, Qi Min
https://arxiv.org/abs/2509.10163
BuildingGym: An open-source toolbox for AI-based building energy management using reinforcement learning
Xilei Dai, Ruotian Chen, Songze Guan, Wen-Tai Li, Chau Yuen
https://arxiv.org/abs/2509.11922
Generalizable Pareto-Optimal Offloading with Reinforcement Learning in Mobile Edge Computing
Ning Yang, Junrui Wen, Meng Zhang, Ming Tang
https://arxiv.org/abs/2509.10474 https:…
Coordinated Reinforcement Learning Prefetching Architecture for Multicore Systems
Mohammed Humaid Siddiqui, Fernando Guzman, Yufei Wu, Ruishu Ann
https://arxiv.org/abs/2509.10719
Using Reinforcement Learning to Optimize the Global and Local Crossing Number
Timo Brand, Henry F\"orster, Stephen Kobourov, Robin Schukrafft, Markus Wallinger, Johannes Zink
https://arxiv.org/abs/2509.06108
REACH: Reinforcement Learning for Adaptive Microservice Rescheduling in the Cloud-Edge Continuum
Xu Bai, Muhammed Tawfiqul Islam, Rajkumar Buyya, Adel N. Toosi
https://arxiv.org/abs/2510.06675
Scaling Up without Fading Out: Goal-Aware Sparse GNN for RL-based Generalized Planning
Sangwoo Jeon, Juchul Shin, Gyeong-Tae Kim, YeonJe Cho, Seongwoo Kim
https://arxiv.org/abs/2508.10747
Reasoning Pattern Matters: Learning to Reason without Human Rationales
Chaoxu Pang, Yixuan Cao, Ping Luo
https://arxiv.org/abs/2510.12643 https://arxiv.org…