Vision-Based Assistive Technologies for People with Cerebral Visual Impairment: A Review and Focus Study
Bhanuka Gamage, Leona Holloway, Nicola McDowell, Thanh-Toan Do, Nicholas Price, Arthur Lowery, Kim Marriott
https://arxiv.org/abs/2505.22983
Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion
Chunlong Xie, Jialing He, Shangwei Guo, Jiacheng Wang, Shudong Zhang, Tianwei Zhang, Tao Xiang
https://arxiv.org/abs/2505.23266
This https://arxiv.org/abs/2412.19297 has been replaced.
initial toot: https://mastoxiv.page/@…
This https://arxiv.org/abs/2505.19312 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csIR_…
Vision-Integrated High-Quality Neural Speech Coding
Yao Guo, Yang Ai, Rui-Chen Zheng, Hui-Peng Du, Xiao-Hang Jiang, Zhen-Hua Ling
https://arxiv.org/abs/2505.23379
This https://arxiv.org/abs/2503.01879 has been replaced.
link: https://scholar.google.com/scholar?q=a
Refining Datapath for Microscaling ViTs
Can Xiao, Jianyi Cheng, Aaron Zhao
https://arxiv.org/abs/2505.22194 https://arxiv.org/pdf/250…
STDR: Spatio-Temporal Decoupling for Real-Time Dynamic Scene Rendering
Zehao Li, Hao Jiang, Yujun Cai, Jianing Chen, Baolong Bi, Shuqin Gao, Honglong Zhao, Yiwei Wang, Tianlu Mao, Zhaoqi Wang
https://arxiv.org/abs/2505.22400
This https://arxiv.org/abs/2505.13062 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csMM_…
Mitigating Audiovisual Mismatch in Visual-Guide Audio Captioning
Le Xu, Chenxing Li, Yong Ren, Yujie Chen, Yu Gu, Ruibo Fu, Shan Yang, Dong Yu
https://arxiv.org/abs/2505.22045