Adaptability of ASR Models on Low-Resource Language: A Comparative Study of Whisper and Wav2Vec-BERT on Bangla
Md Sazzadul Islam Ridoy, Sumi Akter, Md. Aminur Rahman
https://arxiv.org/abs/2507.01931
From Long Videos to Engaging Clips: A Human-Inspired Video Editing Framework with Multimodal Narrative Understanding
Xiangfeng Wang, Xiao Li, Yadong Wei, Xueyu Song, Yang Song, Xiaoqiang Xia, Fangrui Zeng, Zaiyi Chen, Liu Liu, Gu Xu, Tong Xu
https://arxiv.org/abs/2507.02790
Fine-Tuning ASR for Stuttered Speech: Personalized vs. Generalized Approaches
Dena Mujtaba, Nihar Mahapatra
https://arxiv.org/abs/2506.00853 https://
Revisiting Noise-adaptive Transpilation in Quantum Computing: How Much Impact Does it Have?
Yuqian Huo, Jinbiao Wei, Christopher Kverne, Mayur Akewar, Janki Bhimani, Tirthak Patel
https://arxiv.org/abs/2507.01195
Lepton flavor violating decay of true muonium: $\boldsymbol{(\mu^ \mu^-) \to \mu^\pm e^\mp}$
Ryotaro Minato, Akira Sato, Ryosuke Suda, Masato Yamanaka
https://arxiv.org/abs/2507.01193
NGPU-LM: GPU-Accelerated N-Gram Language Model for Context-Biasing in Greedy ASR Decoding
Vladimir Bataev, Andrei Andrusenko, Lilit Grigoryan, Aleksandr Laptev, Vitaly Lavrukhin, Boris Ginsburg
https://arxiv.org/abs/2505.22857
Leveraging In-Context Learning for Political Bias Testing of LLMs
Patrick Haller, Jannis Vamvas, Rico Sennrich, Lena A. J\"ager
https://arxiv.org/abs/2506.22232
Contextualized Automatic Speech Recognition with Dynamic Vocabulary Prediction and Activation
Zhennan Lin, Kaixun Huang, Wei Ren, Linju Yang, Lei Xie
https://arxiv.org/abs/2505.23077
NoticeLight: Embracing Socio-Technical Asymmetry through Tangible Peripheral Robotic Embodiment in Hybrid Collaboration
Marie Altmann, Kimberly Hegemann, Ali Askari, Vineetha Rallabandi, Max Pascher, Jens Gerken
https://arxiv.org/abs/2506.22125
The Instrumental Background of EP/FXT
Juan Zhang, Yong Chen, Shumei Jia, Haisheng Zhao, WeiWei Cui, Tianxiang Chen, Juan Wang, Hao Wang, Jin Wang, Chengkui Li, Xiaofan Zhao, Ju Guan, Dawei Han, Jingjing Xu, Liming Song, Hua Feng, Shuangnan Zhang, Weimin Yuan
https://arxiv.org/abs/2507.00510