Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csNI_bot@mastoxiv.page
2025-06-19 08:28:34

GCN-Driven Reinforcement Learning for Probabilistic Real-Time Guarantees in Industrial URLLC
Eman Alqudah, Ashfaq Khokhar
arxiv.org/abs/2506.15011

@arXiv_eessSP_bot@mastoxiv.page
2025-06-19 08:44:12

Reinforcement Learning-Based Policy Optimisation For Heterogeneous Radio Access
Anup Mishra, \v{C}edomir Stefanovi\'c, Xiuqiang Xu, Petar Popovski, Israel Leyva-Mayorga
arxiv.org/abs/2506.15273

@arXiv_csCL_bot@mastoxiv.page
2025-06-19 08:16:34

CC-LEARN: Cohort-based Consistency Learning
Xiao Ye, Shaswat Shrivastava, Zhaonan Li, Jacob Dineen, Shijie Lu, Avneet Ahuja, Ming Shen, Zhikun Xu, Ben Zhou
arxiv.org/abs/2506.15662

@arXiv_csRO_bot@mastoxiv.page
2025-06-19 08:35:03

Efficient Navigation Among Movable Obstacles using a Mobile Manipulator via Hierarchical Policy Learning
Taegeun Yang, Jiwoo Hwang, Jeil Jeong, Minsung Yoon, Sung-Eui Yoon
arxiv.org/abs/2506.15380

@arXiv_eessSY_bot@mastoxiv.page
2025-06-18 08:47:53

Implicit Constraint-Aware Off-Policy Correction for Offline Reinforcement Learning
Ali Baheri
arxiv.org/abs/2506.14058

@arXiv_csCV_bot@mastoxiv.page
2025-06-18 09:15:11

Recognition through Reasoning: Reinforcing Image Geo-localization with Large Vision-Language Models
Ling Li, Yao Zhou, Yuxuan Liang, Fugee Tsung, Jiaheng Wei
arxiv.org/abs/2506.14674

@arXiv_csAI_bot@mastoxiv.page
2025-06-18 08:04:29

Situational-Constrained Sequential Resources Allocation via Reinforcement Learning
Libo Zhang, Yang Chen, Toru Takisaka, Kaiqi Zhao, Weidong Li, Jiamou Liu
arxiv.org/abs/2506.14125

@arXiv_csHC_bot@mastoxiv.page
2025-06-17 10:52:09

Can you see how I learn? Human observers' inferences about Reinforcement Learning agents' learning processes
Bernhard Hilpert, Muhan Hou, Kim Baraka, Joost Broekens
arxiv.org/abs/2506.13583

@arXiv_csCV_bot@mastoxiv.page
2025-06-18 09:05:09

PoseGRAF: Geometric-Reinforced Adaptive Fusion for Monocular 3D Human Pose Estimation
Ming Xu, Xu Zhang
arxiv.org/abs/2506.14596

@arXiv_csNI_bot@mastoxiv.page
2025-06-17 10:03:13

Dynamic Preference Multi-Objective Reinforcement Learning for Internet Network Management
DongNyeong Heo, Daniela Noemi Rim, Heeyoul Choi
arxiv.org/abs/2506.13153