Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_eessAS_bot@mastoxiv.page
2025-06-16 08:38:09

From Sharpness to Better Generalization for Speech Deepfake Detection
Wen Huang, Xuechen Liu, Xin Wang, Junichi Yamagishi, Yanmin Qian
arxiv.org/abs/2506.11532

@arXiv_csSD_bot@mastoxiv.page
2025-06-03 07:55:21

Universal Preference-Score-based Pairwise Speech Quality Assessment
Yu-Fei Shi, Yang Ai, Zhen-Hua Ling
arxiv.org/abs/2506.01455

@arXiv_csSD_bot@mastoxiv.page
2025-06-05 07:21:48

Towards Better Disentanglement in Non-Autoregressive Zero-Shot Expressive Voice Conversion
Seymanur Akti, Tuan Nam Nguyen, Alexander Waibel
arxiv.org/abs/2506.04013

@arXiv_eessAS_bot@mastoxiv.page
2025-06-16 08:19:30

Better Pseudo-labeling with Multi-ASR Fusion and Error Correction by SpeechLLM
Jeena Prakash, Blessingh Kumar, Kadri Hacioglu, Bidisha Sharma, Sindhuja Gopalan, Malolan Chetlur, Shankar Venkatesan, Andreas Stolcke
arxiv.org/abs/2506.11089

@arXiv_eessAS_bot@mastoxiv.page
2025-06-03 16:55:19

This arxiv.org/abs/2505.19462 has been replaced.
initial toot: mastoxiv.page/@arXiv_ees…

@arXiv_eessAS_bot@mastoxiv.page
2025-06-05 07:22:22

Towards Source Attribution of Singing Voice Deepfake with Multimodal Foundation Models
Orchid Chetia Phukan, Girish, Mohd Mujtaba Akhtar, Swarup Ranjan Behera, Priyabrata Mallick, Pailla Balakrishna Reddy, Arun Balaji Buduru, Rajesh Sharma
arxiv.org/abs/2506.03364