 @arXiv_csLG_bot@mastoxiv.page
 @arXiv_csLG_bot@mastoxiv.pageLearning Distinguishable Representations in Deep Q-Networks for Linear Transfer
Sooraj Sathish, Keshav Goyal, Raghuram Bharadwaj Diddigi
https://arxiv.org/abs/2509.24947 https:/…
 @arXiv_csGT_bot@mastoxiv.page
 @arXiv_csGT_bot@mastoxiv.pageGrouped Satisficing Paths in Pure Strategy Games: a Topological Perspective
Yanqing Fu, Chao Huang, Chenrun Wang, Zhuping Wang
https://arxiv.org/abs/2509.23157 https://
 @arXiv_csAI_bot@mastoxiv.page
 @arXiv_csAI_bot@mastoxiv.pageReST-RL: Achieving Accurate Code Reasoning of LLMs with Optimized Self-Training and Decoding
Sining Zhoubian, Dan Zhang, Yuxiao Dong, Jie Tang
https://arxiv.org/abs/2508.19576 h…
 @arXiv_csCR_bot@mastoxiv.page
 @arXiv_csCR_bot@mastoxiv.pageTowards Production-Worthy Simulation for Autonomous Cyber Operations
Konur Tholl, Mariam El Mezouar, Ranwa Al Mallah
https://arxiv.org/abs/2508.19278 https://
 @arXiv_eessSY_bot@mastoxiv.page
 @arXiv_eessSY_bot@mastoxiv.pageReinforcement Learning-based Control via Y-wise Affine Neural Networks (YANNs)
Austin Braniff, Yuhe Tian
https://arxiv.org/abs/2508.16474 https://arxiv.org…
 @Dragofix@veganism.social
 @Dragofix@veganism.socialA closer look at Peru’s Amazon reveals new mining trends, deforestation https://news.mongabay.com/2025/10/a-closer-look-at-perus-amazon-reveals-new-mining-trends-deforestation/
 @arXiv_csCL_bot@mastoxiv.page
 @arXiv_csCL_bot@mastoxiv.pageEmbedding Domain Knowledge for Large Language Models via Reinforcement Learning from Augmented Generation
Chaojun Nie, Jun Zhou, Guanxiang Wang, Shisong Wud, Zichen Wang
https://arxiv.org/abs/2509.20162 
 @arXiv_csLG_bot@mastoxiv.page
 @arXiv_csLG_bot@mastoxiv.pageTree Search for LLM Agent Reinforcement Learning
Yuxiang Ji, Ziyu Ma, Yong Wang, Guanhua Chen, Xiangxiang Chu, Liaoni Wu
https://arxiv.org/abs/2509.21240 https://
 @arXiv_csCR_bot@mastoxiv.page
 @arXiv_csCR_bot@mastoxiv.pageAttackers Strike Back? Not Anymore - An Ensemble of RL Defenders Awakens for APT Detection
Sidahmed Benabderrahmane, Talal Rahwan
https://arxiv.org/abs/2508.19072 https://
 @arXiv_csLG_bot@mastoxiv.page
 @arXiv_csLG_bot@mastoxiv.pageRL Is Neither a Panacea Nor a Mirage: Understanding Supervised vs. Reinforcement Learning Fine-Tuning for LLMs
Hangzhan Jin, Sicheng Lv, Sifan Wu, Mohammad Hamdaqa
https://arxiv.org/abs/2508.16546