Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csLG_bot@mastoxiv.page
2025-10-14 13:37:08

How Reinforcement Learning After Next-Token Prediction Facilitates Learning
Nikolaos Tsilivis, Eran Malach, Karen Ullrich, Julia Kempe
arxiv.org/abs/2510.11495

@arXiv_csCV_bot@mastoxiv.page
2025-10-15 10:44:21

CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving
Xiaoji Zheng, Ziyuan Yang, Yanhao Chen, Yuhang Peng, Yuanrong Tang, Gengyuan Liu, Bokui Chen, Jiangtao Gong
arxiv.org/abs/2510.12560

@arXiv_csRO_bot@mastoxiv.page
2025-10-15 10:12:01

Residual MPC: Blending Reinforcement Learning with GPU-Parallelized Model Predictive Control
Se Hwan Jeon, Ho Jae Lee, Seungwoo Hong, Sangbae Kim
arxiv.org/abs/2510.12717

@paulbusch@mstdn.ca
2025-09-14 17:35:04

On Sunday, October 5, 2025, my wife and I are participating in the Canadian Cancer Society's Run for the Cure. This national fundraiser brings together communities across the country who will run or walk in support of all Canadians impacted by breast cancer. We are walking in honour of my mother, my two sisters, and both our daughters - all of whom have been touched by this terrible disease.
Our family isn't special as 1 in 8 women will face a breast cancer diagnosis in their lifetime, It is the most commonly diagnosed cancer among Canadian women, and all of us likely know someone who has battled it in the past, or is fighting it today. I encourage all Canadians to donate in honour of someone they know or simply to help their communities fight breast cancer. This cause is particularly important to us, especially this year, as we support our daughters, and I hope all Canadians get involved.
#CanadianCancerSociety #RunForTheCure
support.cancer.ca/site/TR/Runf

@arXiv_eessSY_bot@mastoxiv.page
2025-09-16 11:49:17

Compositional shield synthesis for safe reinforcement learning in partial observability
Steven Carr, Georgios Bakirtzis, Ufuk Topcu
arxiv.org/abs/2509.12085

@arXiv_csGT_bot@mastoxiv.page
2025-09-16 07:41:26

Strategic Cyber Defense via Reinforcement Learning-Guided Combinatorial Auctions
Mai Pham, Vikrant Vaze, Peter Chin
arxiv.org/abs/2509.10983

@arXiv_csAI_bot@mastoxiv.page
2025-10-15 10:10:21

Biased-Attention Guided Risk Prediction for Safe Decision-Making at Unsignalized Intersections
Chengyang Dong, Nan Guo
arxiv.org/abs/2510.12428

@arXiv_csLG_bot@mastoxiv.page
2025-07-17 10:24:30

Thought Purity: Defense Paradigm For Chain-of-Thought Attack
Zihao Xue, Zhen Bi, Long Ma, Zhenlin Hu, Yan Wang, Zhenfang Liu, Qing Sheng, Jie Xiao, Jungang Lou
arxiv.org/abs/2507.12314

@arXiv_eessSY_bot@mastoxiv.page
2025-09-16 11:02:36

Real-Time Defense Against Coordinated Cyber-Physical Attacks: A Robust Constrained Reinforcement Learning Approach
Saman Mazaheri Khamaneh, Tong Wu, Wei Sun, Cong Chen
arxiv.org/abs/2509.10999

@arXiv_csRO_bot@mastoxiv.page
2025-08-15 08:50:22

Few-shot Vision-based Human Activity Recognition with MLLM-based Visual Reinforcement Learning
Wenqi Zheng, Yutaka Arakawa
arxiv.org/abs/2508.10371