Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csSD_bot@mastoxiv.page
2025-06-06 07:21:16

LLM-based phoneme-to-grapheme for phoneme-based speech recognition
Te Ma, Min Bi, Saierdaer Yusuyin, Hao Huang, Zhijian Ou
arxiv.org/abs/2506.04711

The American president wrote, “Vladimir, STOP!” on his Truth Social account in April,
-- but the Russian president did not halt his offensive in eastern Ukraine.
The Ukrainian president called for an unconditional cease-fire in May,
-- but the Russians did not agree to stop attacking Ukrainian civilians from the air.
Donald Trump repeatedly promised, during his campaign, that he would end the war “in one day,”
-- but the war is not over.
He spoke to Vla…

@arXiv_csSD_bot@mastoxiv.page
2025-06-06 07:21:12

Grapheme-Coherent Phonemic and Prosodic Annotation of Speech by Implicit and Explicit Grapheme Conditioning
Hien Ohnaka, Yuma Shirahata, Byeongseon Park, Ryuichi Yamamoto
arxiv.org/abs/2506.04527

@arXiv_csCL_bot@mastoxiv.page
2025-07-03 10:16:50

Adaptability of ASR Models on Low-Resource Language: A Comparative Study of Whisper and Wav2Vec-BERT on Bangla
Md Sazzadul Islam Ridoy, Sumi Akter, Md. Aminur Rahman
arxiv.org/abs/2507.01931

@arXiv_eessAS_bot@mastoxiv.page
2025-06-03 07:56:26

Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric Speech
Karl El Hajal, Enno Hermann, Sevada Hovsepyan, Mathew Magimai. -Doss
arxiv.org/abs/2506.01618

@arXiv_csAI_bot@mastoxiv.page
2025-07-01 09:59:13

AURA: Agent for Understanding, Reasoning, and Automated Tool Use in Voice-Driven Tasks
Leander Melroy Maben, Gayathri Ganesh Lakshmy, Srijith Radhakrishnan, Siddhant Arora, Shinji Watanabe
arxiv.org/abs/2506.23049

@arXiv_csCL_bot@mastoxiv.page
2025-06-03 08:20:21

NAVER LABS Europe Submission to the Instruction-following Track
Beomseok Lee, Marcely Zanon Boito, Laurent Besacier, Ioan Calapodescu
arxiv.org/abs/2506.01808

@arXiv_csSD_bot@mastoxiv.page
2025-06-03 07:37:27

Fine-Tuning ASR for Stuttered Speech: Personalized vs. Generalized Approaches
Dena Mujtaba, Nihar Mahapatra
arxiv.org/abs/2506.00853

@arXiv_csSD_bot@mastoxiv.page
2025-06-04 13:37:15

This arxiv.org/abs/2505.24200 has been replaced.
initial toot: mastoxiv.page/@arXiv_csSD_…

@arXiv_csCL_bot@mastoxiv.page
2025-06-30 07:55:49

Efficient Multilingual ASR Finetuning via LoRA Language Experts
Jiahong Li, Yiwen Shao, Jianheng Zhuo, Chenda Li, Liliang Tang, Dong Yu, Yanmin Qian
arxiv.org/abs/2506.21555