Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csCL_bot@mastoxiv.page
2025-09-18 09:42:31

DSCC-HS: A Dynamic Self-Reinforcing Framework for Hallucination Suppression in Large Language Models
Xiao Zheng
arxiv.org/abs/2509.13702 ar…

A study published in 2021 presented cuttlefish with a new version of the "marshmallow test",
and the results showed there's more going on in their strange little brains than we ever suspected.
Their ability to learn, anticipate future rewards, and adapt their behavior, the researchers said,
may have evolved to give cuttlefish an edge in the cutthroat eat-or-be-eaten marine world they live

@arXiv_csLG_bot@mastoxiv.page
2025-09-15 09:48:11

Federated Multi-Agent Reinforcement Learning for Privacy-Preserving and Energy-Aware Resource Management in 6G Edge Networks
Francisco Javier Esono Nkulu Andong, Qi Min
arxiv.org/abs/2509.10163

@arXiv_csAI_bot@mastoxiv.page
2025-09-16 11:15:16

BuildingGym: An open-source toolbox for AI-based building energy management using reinforcement learning
Xilei Dai, Ruotian Chen, Songze Guan, Wen-Tai Li, Chau Yuen
arxiv.org/abs/2509.11922

@arXiv_eessSY_bot@mastoxiv.page
2025-09-16 08:09:16

Generalizable Pareto-Optimal Offloading with Reinforcement Learning in Mobile Edge Computing
Ning Yang, Junrui Wen, Meng Zhang, Ming Tang
arxiv.org/abs/2509.10474

@arXiv_csDC_bot@mastoxiv.page
2025-09-16 07:55:36

Coordinated Reinforcement Learning Prefetching Architecture for Multicore Systems
Mohammed Humaid Siddiqui, Fernando Guzman, Yufei Wu, Ruishu Ann
arxiv.org/abs/2509.10719

@arXiv_csCG_bot@mastoxiv.page
2025-09-09 07:33:41

Using Reinforcement Learning to Optimize the Global and Local Crossing Number
Timo Brand, Henry F\"orster, Stephen Kobourov, Robin Schukrafft, Markus Wallinger, Johannes Zink
arxiv.org/abs/2509.06108

@arXiv_csDC_bot@mastoxiv.page
2025-10-09 09:12:51

REACH: Reinforcement Learning for Adaptive Microservice Rescheduling in the Cloud-Edge Continuum
Xu Bai, Muhammed Tawfiqul Islam, Rajkumar Buyya, Adel N. Toosi
arxiv.org/abs/2510.06675

@arXiv_csAI_bot@mastoxiv.page
2025-08-15 09:45:32

Scaling Up without Fading Out: Goal-Aware Sparse GNN for RL-based Generalized Planning
Sangwoo Jeon, Juchul Shin, Gyeong-Tae Kim, YeonJe Cho, Seongwoo Kim
arxiv.org/abs/2508.10747

@arXiv_csCL_bot@mastoxiv.page
2025-10-15 10:44:11

Reasoning Pattern Matters: Learning to Reason without Human Rationales
Chaoxu Pang, Yixuan Cao, Ping Luo
arxiv.org/abs/2510.12643 arxiv.org…