Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csSD_bot@mastoxiv.page
2025-06-12 07:57:21

Incorporating Linguistic Constraints from External Knowledge Source for Audio-Visual Target Speech Extraction
Wenxuan Wu, Shuai Wang, Xixin Wu, Helen Meng, Haizhou Li
arxiv.org/abs/2506.09792

@trochee@dair-community.social
2025-06-12 00:21:24

OH in corpus-linguistic UX convo:
> He who go too far down long tail end up wagging dog

@arXiv_csHC_bot@mastoxiv.page
2025-06-13 07:39:20

Intergenerational AI Literacy in Korean Immigrant Families: Interpretive Gatekeeping Meets Convenient Critical Deferment
Jeongone Seo, Ryan Womack, Tawfiq Ammari
arxiv.org/abs/2506.10197

@arXiv_eessAS_bot@mastoxiv.page
2025-06-12 08:05:21

You Are What You Say: Exploiting Linguistic Content for VoicePrivacy Attacks
\"Unal Ege Gaznepoglu, Anna Leschanowsky, Ahmad Aloradi, Prachi Singh, Daniel Tenbrinck, Emanu\"el A. P. Habets, Nils Peters
arxiv.org/abs/2506.09521

@arXiv_eessSY_bot@mastoxiv.page
2025-06-11 09:07:15

Linguistic Ordered Weighted Averaging based deep learning pooling for fault diagnosis in a wastewater treatment plant
Alicia Beneyto-Rodriguez, Gregorio I. Sainz-Palmero, Marta Galende-Hern\'andez, Mar\'ia J. Fuente
arxiv.org/abs/2506.08676

@arXiv_csCV_bot@mastoxiv.page
2025-06-10 19:09:11

This arxiv.org/abs/2506.03589 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCV_…

@arXiv_csCY_bot@mastoxiv.page
2025-06-06 07:17:42

Early linguistic fingerprints of online users who engage with conspiracy communities
Francesco Corso, Giuseppe Russo, Francesco Pierri, Gianmarco De Francisci Morales
arxiv.org/abs/2506.05086

@arXiv_csMM_bot@mastoxiv.page
2025-06-04 07:23:10

StarVC: A Unified Auto-Regressive Framework for Joint Text and Speech Generation in Voice Conversion
Fengjin Li, Jie Wang, Yadong Niu, Yongqing Wang, Meng Meng, Jian Luan, Zhiyong Wu
arxiv.org/abs/2506.02414

@tschfflr@fediscience.org
2025-06-04 07:19:14

Students, THIS is how generative AI (ChatGPT et alia) "reads" papers and "understands" content. It's a bullshit machine, a gaslighting machine. It shows the linguistic behavior of a psychopath (is this what us humans average to, if one trains on all our "content" and online behavior?). Yikes.
amandaguinzburg.substack.com/p

@arXiv_csSD_bot@mastoxiv.page
2025-06-09 07:54:52

Voice Impression Control in Zero-Shot TTS
Keinichi Fujita, Shota Horiguchi, Yusuke Ijima
arxiv.org/abs/2506.05688 arx…

@hakona@im.alstadheim.no
2025-06-03 20:32:38

We have in fact encountered similar things to "linguistic sequence processing". Bullshit artists, politicians with the "truthiness" of Ronald Reagan (coined by a comedian at the time). I have had personal experience being a rabbit in a spotlight where this kind of "thinking" kicks in. Got out of that game, thankfully.

@arXiv_csAI_bot@mastoxiv.page
2025-06-05 09:36:37

This arxiv.org/abs/2502.00698 has been replaced.
initial toot: mastoxiv.page/@arXiv_csAI_…

@arXiv_csLG_bot@mastoxiv.page
2025-06-03 08:21:31

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics
Mustafa Shukor, Dana Aubakirova, Francesco Capuano, Pepijn Kooijmans, Steven Palma, Adil Zouitine, Michel Aractingi, Caroline Pascal, Martino Russi, Andres Marafioti, Simon Alibert, Matthieu Cord, Thomas Wolf, Remi Cadene
arxiv.org/abs/2506.018…

@tanyakaroli@expressional.social
2025-05-29 11:33:07

Den fŸlelse når man er fan af en ny podcast - og så selv bliver inviteret på den! 🤓🎉
podcasts.apple.com/dk/podcast/

@arXiv_csCL_bot@mastoxiv.page
2025-06-03 08:20:50

From Guidelines to Practice: A New Paradigm for Arabic Language Model Evaluation
Serry Sibaee, Omer Nacar, Adel Ammar, Yasser Al-Habashi, Abdulrahman Al-Batati, Wadii Boulila
arxiv.org/abs/2506.01920

@arXiv_csSD_bot@mastoxiv.page
2025-06-11 08:08:45

Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Ailin Huang, Bingxin Li, Bruce Wang, Boyong Wu, Chao Yan, Chengli Feng, Heng Wang, Hongyu Zhou, Hongyuan Wang, Jingbei Li, Jianjian Sun, Joanna Wang, Mingrui Chen, Peng Liu, Ruihang Miao, Shilei Jiang, Tian Fei, Wang You, Xi Chen, Xuerui Yang, Yechang Huang, Yuxiang Zhang, Zheng Ge, Zheng Gong, Zhewei Huang, Zixin Zhang, Bin Wang, Bo Li, Buyun Ma, Changxin Miao, Changyi Wan, Chen Xu, Dapeng Shi, Dingyuan Hu, Enle…

@arXiv_eessAS_bot@mastoxiv.page
2025-06-10 16:49:59

This arxiv.org/abs/2410.00527 has been replaced.
initial toot: mastoxiv.page/@arXiv_ees…

@arXiv_csAI_bot@mastoxiv.page
2025-06-05 09:45:09

This arxiv.org/abs/2506.02139 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_astrophIM_bot@mastoxiv.page
2025-06-04 07:45:38

An Exploratory Framework for Future SETI Applications: Detecting Generative Reactivity via Language Models
Po-Chieh Yu
#toXiv_bot_toot

@arXiv_eessAS_bot@mastoxiv.page
2025-06-10 08:39:12

Rhythm Features for Speaker Identification
Nick Mehlman, Thomas Thebaud, Dani Byrd, Shri Narayanan
arxiv.org/abs/2506.06834

@arXiv_csMM_bot@mastoxiv.page
2025-06-02 09:58:31

This arxiv.org/abs/2505.23018 has been replaced.
initial toot: mastoxiv.page/@arXiv_csMM_…

@arXiv_csSD_bot@mastoxiv.page
2025-06-04 07:33:42

Breaking the Barriers of Text-Hungry and Audio-Deficient AI
Hamidou Tembine, Issa Bamia, Massa NDong, Bakary Coulibaly, Oumar Issiaka Traore, Moussa Traore, Moussa Sanogo, Mamadou Eric Sangare, Salif Kante, Daryl Noupa Yongueng, Hafiz Tiomoko Ali, Malik Tiomoko, Frejus Laleye, Boualem Djehiche, Wesmanegda Elisee Dipama, Idris Baba Saje, Hammid Mohammed Ibrahim, Moumini Sanogo, Marie Coursel Nininahazwe, Abdul-Latif Siita, Haine Mhlongo, Teddy Nelvy Dieu Merci Kouka, Mariam Serine Jerid…

@arXiv_csSD_bot@mastoxiv.page
2025-06-05 07:21:48

Towards Better Disentanglement in Non-Autoregressive Zero-Shot Expressive Voice Conversion
Seymanur Akti, Tuan Nam Nguyen, Alexander Waibel
arxiv.org/abs/2506.04013

@arXiv_csMM_bot@mastoxiv.page
2025-05-30 09:54:17

This arxiv.org/abs/2503.01879 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_eessAS_bot@mastoxiv.page
2025-06-03 16:15:00

This arxiv.org/abs/2409.03636 has been replaced.
initial toot: mastoxiv.page/@arXiv_ees…

@arXiv_csMM_bot@mastoxiv.page
2025-05-30 07:19:35

EmotionTalk: An Interactive Chinese Multimodal Emotion Dataset With Rich Annotations
Haoqin Sun, Xuechen Wang, Jinghua Zhao, Shiwan Zhao, Jiaming Zhou, Hui Wang, Jiabei He, Aobo Kong, Xi Yang, Yequan Wang, Yonghua Lin, Yong Qin
arxiv.org/abs/2505.23018

@arXiv_eessAS_bot@mastoxiv.page
2025-06-02 10:04:35

This arxiv.org/abs/2505.15004 has been replaced.
initial toot: mastoxiv.page/@arXiv_ees…