Benchmarking Training Paradigms, Dataset Composition, and Model Scaling for Child ASR in ESPnet
Anyu Ying, Natarajan Balaji Shankar, Chyi-Jiunn Lin, Mohan Shi, Pu Wang, Hye-jin Shim, Siddhant Arora, Hugo Van hamme, Abeer Alwan, Shinji Watanabe
https://arxiv.org/abs/2508.16576
i-LAVA: Insights on Low Latency Voice-2-Voice Architecture for Agents
Anupam Purwar, Aditya Choudhary
https://arxiv.org/abs/2509.20971 https://arxiv.org/pd…
Danish Minister of Justice and chief architect of the current Chat Control proposal, Peter Hummelgaard:
"We must break with the totally erroneous perception that it is everyone's civil liberty to communicate on encrypted messaging services."
Ok, I agree but on one condition: "everyone" means EVERYONE. Police, military, government, politicians, lobbyists, 100% 7x24 transparency. No closed door meetings, no off the record, no sly meets in a pub. Everyone.
Anything else is sinister.
Implicit Augmentation from Distributional Symmetry in Turbulence Super-Resolution
Julia Balla, Jeremiah Bailey, Ali Backour, Elyssa Hofgard, Tommi Jaakkola, Tess Smidt, Ryley McConkey
https://arxiv.org/abs/2509.20683
System-driven Interactive Design Support for Cloud Architecture: A Qualitative User Experience Study with Novice Engineers
Ryosuke Kohita, Akira Kasuga
https://arxiv.org/abs/2508.12385
A Systematic Review of FAIR-compliant Big Data Software Reference Architectures
Jo\~ao Pedro de Carvalho Castro, Maria J\'ulia Soares De Grandi, Cristina Dutra de Aguiar
https://arxiv.org/abs/2509.14370
HarmoniFuse: A Component-Selective and Prompt-Adaptive Framework for Multi-Task Speech Language Modeling
Yuke Si, Runyan Yang, Yingying Gao, Junlan Feng, Chao Deng, Shilei Zhang
https://arxiv.org/abs/2509.18570
Thinking in cocktail party: Chain-of-Thought and reinforcement learning for target speaker automatic speech recognition
Yiru Zhang, Hang Su, Lichun Fan, Zhenbo Luo, Jian Luan
https://arxiv.org/abs/2509.15612
From SALAMANDRA to SALAMANDRATA: BSC Submission for WMT25 General Machine Translation Shared Task
Javier Garcia Gilabert, Xixian Liao, Severino Da Dalt, Ella Bohman, Audrey Mash, Francesca De Luca Fornaciari, Irene Baucells, Joan Llop, Miguel Claramunt Argote, Carlos Escolano, Maite Melero
https://arxiv.org/abs/2508.12774
Analysis of Domain Shift across ASR Architectures via TTS-Enabled Separation of Target Domain and Acoustic Conditions
Tina Raissi, Nick Rossenbach, Ralf Schl\"uter
https://arxiv.org/abs/2508.09868