Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csCL_bot@mastoxiv.page
2025-09-18 10:16:51

Canary-1B-v2 & Parakeet-TDT-0.6B-v3: Efficient and High-Performance Models for Multilingual ASR and AST
Monica Sekoyan, Nithin Rao Koluguri, Nune Tadevosyan, Piotr Zelasko, Travis Bartley, Nick Karpov, Jagadeesh Balam, Boris Ginsburg
arxiv.org/abs/2509.14128

The Trump administration has vowed to crack down on what it calls hate speech. It has labeled antifa, a loosely organized anti-fascist group, a terrorist organization.
And it has sought to punish figures such as TV host Jimmy Kimmel for statements perceived critical of conservative activists.
What the First Amendment makes clear is that it does not just protect the rights of speakers who say things with which Americans agree.
Or, as the Supreme Court said in a separate deci…

@Techmeme@techhub.social
2025-11-10 23:45:44

Meta introduces Omnilingual Automatic Speech Recognition, a suite of AI models providing automatic speech recognition capabilities for more than 1,600 languages (Carl Franzen/VentureBeat)
venturebeat.com/ai/meta-return

@arXiv_csSD_bot@mastoxiv.page
2025-10-14 11:06:48

Proficiency-Aware Adaptation and Data Augmentation for Robust L2 ASR
Ling Sun, Charlotte Zhu, Shuju Shi
arxiv.org/abs/2510.10738 arxiv.org/…

@arXiv_eessAS_bot@mastoxiv.page
2025-10-13 07:41:50

Articulation-Informed ASR: Integrating Articulatory Features into ASR via Auxiliary Speech Inversion and Cross-Attention Fusion
Ahmed Adel Attia, Jing Liu, Carol Espy Wilson
arxiv.org/abs/2510.08585

@arXiv_csCL_bot@mastoxiv.page
2025-10-13 10:38:30

Accent-Invariant Automatic Speech Recognition via Saliency-Driven Spectrogram Masking
Mohammad Hossein Sameti, Sepehr Harfi Moridani, Ali Zarean, Hossein Sameti
arxiv.org/abs/2510.09528

@arXiv_csCL_bot@mastoxiv.page
2025-10-09 10:32:01

Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation
Vaibhav Srivastav, Steven Zheng, Eric Bezzam, Eustache Le Bihan, Nithin Koluguri, Piotr \.Zelasko, Somshubra Majumdar, Adel Moumen, Sanchit Gandhi
arxiv.org/abs/2510.06961

@arXiv_csCL_bot@mastoxiv.page
2025-10-06 10:20:29

Listening or Reading? Evaluating Speech Awareness in Chain-of-Thought Speech-to-Text Translation
Jacobo Romero-D\'iaz, Gerard I. G\'allego, Oriol Pareras, Federico Costa, Javier Hernando, Cristina Espa\~na-Bonet
arxiv.org/abs/2510.03115

@arXiv_csCL_bot@mastoxiv.page
2025-10-06 10:20:09

Revisiting Direct Speech-to-Text Translation with Speech LLMs: Better Scaling than CoT Prompting?
Oriol Pareras, Gerard I. G\'allego, Federico Costa, Cristina Espa\~na-Bonet, Javier Hernando
arxiv.org/abs/2510.03093