Pronunciation Deviation Analysis Through Voice Cloning and Acoustic Comparison
Andrew Valdivia, Yueming Zhang, Hailu Xu, Amir Ghasemkhani, Xin Qin
https://arxiv.org/abs/2507.10985
V(is)owel: An Interactive Vowel Chart to Understand What Makes Visual Pronunciation Effective in Second Language Learning
Charlotte Kiesel, Dipayan Mukherjee, Mark Hasegawa-Johnson, Karrie Karahalios
https://arxiv.org/abs/2507.06202
Evaluating Logit-Based GOP Scores for Mispronunciation Detection
Aditya Kamlesh Parikh, Cristian Tejedor-Garcia, Catia Cucchiarini, Helmer Strik
https://arxiv.org/abs/2506.12067
Pronunciation-Lexicon Free Training for Phoneme-based Crosslingual ASR via Joint Stochastic Approximation
Saierdaer Yusuyin, Te Ma, Hao Huang, Zhijian Ou
https://arxiv.org/abs/2507.06249
@… prepared for complaints about the proper pronunciation of "shout"
Intelligibility of Text-to-Speech Systems for Mathematical Expressions
Sujoy Roychowdhury, H. G. Ranjani, Sumit Soman, Nishtha Paul, Subhadip Bandyopadhyay, Siddhanth Iyengar
https://arxiv.org/abs/2506.11086
Enhancing GOP in CTC-Based Mispronunciation Detection with Phonological Knowledge
Aditya Kamlesh Parikh, Cristian Tejedor-Garcia, Catia Cucchiarini, Helmer Strik
https://arxiv.org/abs/2506.02080
Dhvani: A Weakly-supervised Phonemic Error Detection and Personalized Feedback System for Hindi
Arnav Rustagi, Satvik Bajpai, Nimrat Kaur, Siddharth Siddharth
https://arxiv.org/abs/2506.02166
Pronunciation Editing for Finnish Speech using Phonetic Posteriorgrams
Zirui Li, Lauri Juvela, Mikko Kurimo
https://arxiv.org/abs/2507.02115 https://