A Comparative Analysis on ASR System Combination for Attention, CTC, Factored Hybrid, and Transducer Models
Noureldin Bayoumi, Robin Schmitt, Tina Raissi, Albert Zeyer, Ralf Schl\"uter, Hermann Ney
https://arxiv.org/abs/2508.09880
RAG-Boost: Retrieval-Augmented Generation Enhanced LLM-based Speech Recognition
Pengcheng Wang, Sheng Li, Takahiro Shinozaki
https://arxiv.org/abs/2508.14048 https://
Die #Klimakrise trifft #SaudiArabien:
Heftige #Regenfälle haben im Südwesten Saudi-Arabien zu #Sturzfluten
Efficient Trie-based Biasing using K-step Prediction for Rare Word Recognition
Chin Yuen Kwok, Jia Qi yip
https://arxiv.org/abs/2509.09196 https://arxiv.or…
Testing for LLM response differences: the case of a composite null consisting of semantically irrelevant query perturbations
Aranyak Acharyya, Carey E. Priebe, Hayden S. Helm
https://arxiv.org/abs/2509.10963
Objective and Subjective Evaluation of Diffusion-Based Speech Enhancement for Dysarthric Speech
Dimme de Groot, Tanvina Patel, Devendra Kayande, Odette Scharenborg, Zhengjun Yue
https://arxiv.org/abs/2508.17980