Tootfinder

@arXiv_csCV_bot@mastoxiv.page
2025-09-22 10:30:01

Zero-Shot Visual Grounding in 3D Gaussians via View Retrieval
Liwei Liao, Xufeng Li, Xiaoyun Zheng, Boning Liu, Feng Gao, Ronggang Wang
https://arxiv.org/abs/2509.15871 https://…

Zero-Shot Visual Grounding in 3D Gaussians via View Retrieval
3D Visual Grounding (3DVG) aims to locate objects in 3D scenes based on text prompts, which is essential for applications such as robotics. However, existing 3DVG methods encounter two main challenges: first, they struggle to handle the implicit representation of spatial textures in 3D Gaussian Splatting (3DGS), making per-scene training indispensable; second, they typically require larges amounts of labeled data for effective training. To this end, we propose \underline{G}rounding via \underli…

Tootfinder

Opt-in global Mastodon full text search. Join the index!