Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csRO_bot@mastoxiv.page
2025-06-26 08:59:00

PSALM-V: Automating Symbolic Planning in Interactive Visual Environments with Large Language Models
Wang Bill Zhu, Miaosen Chai, Ishika Singh, Robin Jia, Jesse Thomason
arxiv.org/abs/2506.20097

@arXiv_csAI_bot@mastoxiv.page
2025-06-24 09:14:10

Beyond Syntax: Action Semantics Learning for App Agents
Bohan Tang, Dezhao Luo, Jingxuan Chen, Shaogang Gong, Jianye Hao, Jun Wang, Kun Shao
arxiv.org/abs/2506.17697

@arXiv_csCV_bot@mastoxiv.page
2025-08-26 17:42:34

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/6]:
- Beyond Label Semantics: Language-Guided Action Anatomy for Few-shot Action Recognition
Zefeng Qian, Xincheng Yao, Yifei Huang, Chongyang Zhang, Jiangyong Ying, Hong Sun

@arXiv_csRO_bot@mastoxiv.page
2025-07-25 09:28:52

ReSem3D: Refinable 3D Spatial Constraints via Fine-Grained Semantic Grounding for Generalizable Robotic Manipulation
Chenyu Su, Weiwei Shang, Chen Qian, Fei Zhang, Shuang Cong
arxiv.org/abs/2507.18262