Quantifying and Reducing Speaker Heterogeneity within the Common Voice Corpus for Phonetic AnalysisMiao Zhang, Aref Farhadipour, Annie Baker, Jiachen Ma, Bogdan Pricop, Eleanor Chodroffhttps://arxiv.org/abs/2506.00733
Quantifying and Reducing Speaker Heterogeneity within the Common Voice Corpus for Phonetic AnalysisWith its crosslinguistic and cross-speaker diversity, the Mozilla Common Voice Corpus (CV) has been a valuable resource for multilingual speech technology and holds tremendous potential for research in crosslinguistic phonetics and speech sciences. Properly accounting for speaker variation is, however, key to the theoretical and statistical bases of speech research. While CV provides a client ID as an approximation to a speaker ID, multiple speakers can contribute under the same ID. This study …