Prominence-aware automatic speech recognition for conversational speech
Julian Linke, Barbara Schuppler
https://arxiv.org/abs/2509.10116 https://arxiv.org/…
Data-independent Beamforming for End-to-end Multichannel Multi-speaker ASR
Can Cui, Paul Magron, Mostafa Sadeghi, Emmanuel Vincent
https://arxiv.org/abs/2509.10234 https://
Space-time Coded Differential Modulation for Reconfigurable Intelligent Surfaces
Jiawei Qiu, Harry Leib
https://arxiv.org/abs/2508.10244 https://arxiv.org/…
WhisTLE: Deeply Supervised, Text-Only Domain Adaptation for Pretrained Speech Recognition Transformers
Akshat Pandey, Karun Kumar, Raphael Tang
https://arxiv.org/abs/2509.10452 …
Exploring Cross-Utterance Speech Contexts for Conformer-Transducer Speech Recognition Systems
Mingyu Cui, Mengzhe Geng, Jiajun Deng, Chengxi Deng, Jiawen Kang, Shujie Hu, Guinan Li, Tianzi Wang, Zhaoqing Li, Xie Chen, Xunying Liu
https://arxiv.org/abs/2508.10456
Analysis of Domain Shift across ASR Architectures via TTS-Enabled Separation of Target Domain and Acoustic Conditions
Tina Raissi, Nick Rossenbach, Ralf Schl\"uter
https://arxiv.org/abs/2508.09868
A Comparative Analysis on ASR System Combination for Attention, CTC, Factored Hybrid, and Transducer Models
Noureldin Bayoumi, Robin Schmitt, Tina Raissi, Albert Zeyer, Ralf Schl\"uter, Hermann Ney
https://arxiv.org/abs/2508.09880
Assessing the Feasibility of Lightweight Whisper Models for Low-Resource Urdu Transcription
Abdul Rehman Antall, Naveed Akhtar
https://arxiv.org/abs/2508.09865 https://
Proficiency-Aware Adaptation and Data Augmentation for Robust L2 ASR
Ling Sun, Charlotte Zhu, Shuju Shi
https://arxiv.org/abs/2510.10738 https://arxiv.org/…
Accent-Invariant Automatic Speech Recognition via Saliency-Driven Spectrogram Masking
Mohammad Hossein Sameti, Sepehr Harfi Moridani, Ali Zarean, Hossein Sameti
https://arxiv.org/abs/2510.09528