Tootfinder

Opt-in global Mastodon full text search. Join the index!

@karlauerbach@sfba.social
2025-09-23 00:52:19

I have long shied away from Broadcom networking products.
But I've been using VMWare with the freely available ESXI that was withdrawn when Broadcom bought VMWare.
Well, there is a new free Esxi - 8.0 update 3. I guess Broadcom realized that blocking free use was damaging their sales of paid-products.
Anyway, I just loaded it onto a machine with an Intel Generation 15 Ultra 7.
Well, out of the box this new Esxi won't go beyond Intel's generation 11. Wow,…

@netzschleuder@social.skewed.de
2025-07-23 06:00:04

jdk: Java SE Dev Kit dependencies (1.6.0.7)
A network of class dependencies within the JDK (Java SE Development Kit) 1.6.0.7 framework. Nodes represent classes and a directed edge indicates a dependency of one class on another.
This network has 6434 nodes and 150985 edges.
Tags: Technological, Software, Unweighted, Multigraph

jdk: Java SE Dev Kit dependencies (1.6.0.7). 6434 nodes, 150985 edges. https://networks.skewed.de/net/jdk
@cowboys@darktundra.xyz
2025-07-23 01:34:22

Pic 6: Training Camp Notebook - 7-22-25 dallascowboys.com/news/pic-6-t

@heiseonline@social.heise.de
2025-09-16 12:25:00

Pixel 7 und 7 Pro: Nutzerbeschwerden über aufgeblähte Akkus
Google scheint weitere Smartphone-Modelle mit Akkuproblemen zu haben: Nutzerberichten zufolge kommt es beim Pixel 7 und 7 Pro zu aufgeblähten Batterien.

@raiders@darktundra.xyz
2025-07-23 23:53:40

Training Camp Notebook 7/23: Young guys prepared to compete raiders.com/news/training-camp

@NFL@darktundra.xyz
2025-07-22 02:01:42

D.C. mayor focused on getting Commanders stadium deal done: 'Nobody is waiting in the wings with $2.7 billion'

cbssports.com/nfl/news/d…

@Techmeme@techhub.social
2025-07-23 20:34:14

IBM reports Q2 revenue up 8% YoY to $16.98B, vs. $16.59B est., and software revenue up 10% to $7.39B, vs. $7.49B est.; IBM drops 5% after hours (Brody Ford/Bloomberg)
bloomberg.com/news/articles/20

@arXiv_eessAS_bot@mastoxiv.page
2025-07-22 07:39:10

[2025-07-22 Tue (UTC), 7 new articles found for eess.AS Audio and Speech Processing]
toXiv_bot_toot

@vosje62@mastodon.nl
2025-09-23 17:30:50

Vanaf 1 oktober 30 kmh in Haarlem.
Op de kaart vind je waar.
Bron:
kaart.haarlem.nl/app/map/49?zo

Legenda
Deze kaart laat zien welke wegen van 50 km/u naar 30 km/u gaan en wanneer dat gebeurt. Klik op i (informatie knop) rechtsboven in dit tekst blok voor meer informatie en gebruik van de kaart.

1 okt 2025	

Bij geplande herinrichting	

Nog niet gepland	

Blijft 50km/u
@servelan@newsie.social
2025-07-23 00:41:45

Is Trump preparing to pardon Epstein's notorious accomplice?
dailykos.com/stories/2025/7/22

@rainerzufall_le@mastodon.social
2025-07-22 04:15:07

#pastpuzzle 80
🟩🟨🟥🟥 ( 7)
🟩🟩🟩🟥 (-7)
🟩🟩🟩🟥 (-3)
🟩🟩🟩🟥 (-1)
x/4 🟥
pastpuzzle.de

@arXiv_condmatdisnn_bot@mastoxiv.page
2025-07-22 08:20:20

Anisotropic Anderson localization in higher-dimensional nonreciprocal lattices
Jinyuan Shang, Haiping Hu
arxiv.org/abs/2507.14523

@arXiv_physicsdataan_bot@mastoxiv.page
2025-07-22 16:39:57

Replaced article(s) found for physics.data-an. arxiv.org/list/physics.data-an
[1/1]:
- High-Performance Data Format for Scientific Data Storage and Analysis
Gagik Gavalian

@stargazer@woof.tech
2025-07-23 14:46:03

Row row
Fight the power
#photo

@cowboys@darktundra.xyz
2025-07-23 00:33:39

Pic 6: Training Camp Notebook - 7-22-25 dallascowboys.com/news/pic-6-t

@raiders@darktundra.xyz
2025-07-23 22:16:53

Training Camp Notebook 7/23: Young guys prepared to compete raiders.com/news/training-camp

@Techmeme@techhub.social
2025-09-23 21:17:05

OpenAI, Oracle, and SoftBank announce five new data center locations in the US, boosting Stargate's planned capacity to nearly 7 GW (Wired)
wired.com/story/openai-oracle-

@servelan@newsie.social
2025-07-23 00:49:59

The military is spying on bathrooms—and you're paying for it
dailykos.com/stories/2025/7/22

@arXiv_eessAS_bot@mastoxiv.page
2025-07-22 08:14:20

Conan: A Chunkwise Online Network for Zero-Shot Adaptive Voice Conversion
Yu Zhang, Baotong Tian, Zhiyao Duan
arxiv.org/abs/2507.14534

@arXiv_condmatdisnn_bot@mastoxiv.page
2025-07-22 08:19:20

Siamese Neural Network for Label-Efficient Critical Phenomena Prediction in 3D Percolation Models
Shanshan Wang, Dian Xu, Jianmin Shen, Feng Gao, Wei Li, Weibing Deng
arxiv.org/abs/2507.14159

@arXiv_physicsdataan_bot@mastoxiv.page
2025-07-22 08:22:00

[2025-07-22 Tue (UTC), no new articles found for physics.data-an Data Analysis, Statistics and Probability]
toXiv_bot_toot

@stargazer@woof.tech
2025-07-23 08:05:27

When you see a mini-Boss you know you're going the right direction.
>The industry filed false claims against the "Stop Killing Games" initiative
youtube.com/watch?v=fQN_ZA5WRp

@Techmeme@techhub.social
2025-07-22 19:50:57

Sources: dozens of xAI staff voiced concerns over a program granting xAI "perpetual" access to data like their "likeness" for training; several did not consent (Grace Kay/Business Insider)
businessinsider.com/xai-grok-t

@servelan@newsie.social
2025-07-23 15:19:00

Women’s contributions and men’s racism erased from history of national monument
dailykos.com/stories/2025/7/23

@cowboys@darktundra.xyz
2025-07-23 18:39:47

Cowboys writer thinks $6.7 million playmaker won't return to Dallas next season sportingnews.com/us/nfl/dallas

@arXiv_eessAS_bot@mastoxiv.page
2025-07-22 07:58:20

Adapting Whisper for Lightweight and Efficient Automatic Speech Recognition of Children for On-device Edge Applications
Satwik Dutta, Shruthigna Chandupatla, John Hansen
arxiv.org/abs/2507.14451

@arXiv_condmatdisnn_bot@mastoxiv.page
2025-07-22 07:50:00

[2025-07-22 Tue (UTC), 2 new articles found for cond-mat.dis-nn Disordered Systems and Neural Networks]
toXiv_bot_toot

@arXiv_physicsdataan_bot@mastoxiv.page
2025-07-23 08:26:32

[2025-07-23 Wed (UTC), no new articles found for physics.data-an Data Analysis, Statistics and Probability]
toXiv_bot_toot

@stargazer@woof.tech
2025-07-23 06:05:43

068 Вирішення проблем
#comic #ТемнаНаука #переклади

@servelan@newsie.social
2025-07-23 00:53:05

MAGA Death Threats Over Editorial Cartoon — The Week in Editorial Cartoons (Update #12)
dailykos.com/stories/2025/7/13

@arXiv_eessAS_bot@mastoxiv.page
2025-07-22 07:50:30

Towards Accurate Phonetic Error Detection Through Phoneme Similarity Modeling
Xuanru Zhou, Jiachen Lian, Cheol Jun Cho, Tejas Prabhune, Shuhe Li, William Li, Rodrigo Ortiz, Zoe Ezzes, Jet Vonk, Brittany Morin, Rian Bogley, Lisa Wauters, Zachary Miller, Maria Gorno-Tempini, Gopala Anumanchipalli
arxiv.org/abs/2507.14346…

@arXiv_condmatdisnn_bot@mastoxiv.page
2025-07-23 12:49:50

Replaced article(s) found for cond-mat.dis-nn. arxiv.org/list/cond-mat.dis-nn
[1/1]:
- Signature of glassy dynamics in dynamic modes decompositions
Zachary G. Nicolaou, Hangjun Cho, Yuanzhao Zhang, J. Nathan Kutz, Steven L. Brunton

@arXiv_physicsdataan_bot@mastoxiv.page
2025-08-22 10:46:31

Crosslisted article(s) found for physics.data-an. arxiv.org/list/physics.data-an
[1/1]:
- Hierarchical Maximum Likelihood Estimation for Time-Resolved NMR Data
Lennart H. Bosch, et al.

@servelan@newsie.social
2025-07-23 14:56:22

Aid groups warn of ‘mass starvation’ in Gaza | Israel-Palestine conflict News | Al Jazeera
aljazeera.com/gallery/2025/7/2

@arXiv_eessAS_bot@mastoxiv.page
2025-07-22 17:03:23

Replaced article(s) found for eess.AS. arxiv.org/list/eess.AS/new
[1/1]:
- Sortformer: A Novel Approach for Permutation-Resolved Speaker Supervision in Speech-to-Text Systems
Park, Medennikov, Dhawan, Wang, Huang, Koluguri, Puvvada, Balam, Ginsburg

@arXiv_condmatdisnn_bot@mastoxiv.page
2025-07-23 08:22:32

Building Intuition for Dynamical Mean-Field Theory: A Simple Model and the Cavity Method
Emmy Blumenthal
arxiv.org/abs/2507.16654

@arXiv_physicsdataan_bot@mastoxiv.page
2025-08-22 07:50:41

[2025-08-22 Fri (UTC), no new articles found for physics.data-an Data Analysis, Statistics and Probability]
toXiv_bot_toot

@servelan@newsie.social
2025-09-23 20:21:46

'This man is stark raving mad': Senior diplomat reacts in real-time to Trump UN speech - Raw Story
rawstory.com/trump-un-26740256

@arXiv_eessAS_bot@mastoxiv.page
2025-07-22 09:21:40

Binaural Signal Matching with Wearable Arrays for Near-Field Sources
Sapir Goldring, Zamir Ben Hur, David Lou Alon, Boaz Rafaely
arxiv.org/abs/2507.15517

@arXiv_condmatdisnn_bot@mastoxiv.page
2025-07-23 08:19:42

False signatures of non-ergodic behavior in disordered quantum many-body systems
Adith Sai Aramthottil, Ali Emami Kopaei, Piotr Sierant, Lev Vidmar, Jakub Zakrzewski
arxiv.org/abs/2507.16567

@arXiv_physicsdataan_bot@mastoxiv.page
2025-08-22 11:55:00

Replaced article(s) found for physics.data-an. arxiv.org/list/physics.data-an
[1/1]:
- Multi-Exit Kolmogorov-Arnold Networks: enhancing accuracy and parsimony
James Bagrow, Josh Bongard

@servelan@newsie.social
2025-07-23 22:17:14

ACA Subsidies Expire: Annual Health Insurance Costs to Rise up to $1K - Business Insider
businessinsider.com/aca-subsid

@arXiv_eessAS_bot@mastoxiv.page
2025-07-22 09:13:30

Mixture to Beamformed Mixture: Leveraging Beamformed Mixture as Weak-Supervision for Speech Enhancement and Noise-Robust ASR
Zhong-Qiu Wang, Ruizhe Pang
arxiv.org/abs/2507.15229

@arXiv_condmatdisnn_bot@mastoxiv.page
2025-07-23 08:02:42

Anomalous thermal activation of the electron glass dynamics in a-InOx and granular aluminum
Thierry Grenet, Julien Delahaye
arxiv.org/abs/2507.16016

@arXiv_physicsdataan_bot@mastoxiv.page
2025-09-23 09:57:40

Particle Identification with MLPs and PINNs Using HADES Data
Marvin Kohls
arxiv.org/abs/2509.17685 arxiv.org/pdf/2509.17685

@arXiv_eessAS_bot@mastoxiv.page
2025-07-22 09:01:00

DMOSpeech 2: Reinforcement Learning for Duration Prediction in Metric-Optimized Speech Synthesis
Yinghao Aaron Li, Xilin Jiang, Fei Tao, Cheng Niu, Kaifeng Xu, Juntong Song, Nima Mesgarani
arxiv.org/abs/2507.14988

@arXiv_condmatdisnn_bot@mastoxiv.page
2025-07-23 07:51:12

[2025-07-23 Wed (UTC), 3 new articles found for cond-mat.dis-nn Disordered Systems and Neural Networks]
toXiv_bot_toot

@arXiv_physicsdataan_bot@mastoxiv.page
2025-09-23 08:56:10

Comment on Frank Porter, "Confidence intervals for the Poisson distribution"
Robert D. Cousins
arxiv.org/abs/2509.17339 arxiv.org…

@arXiv_eessAS_bot@mastoxiv.page
2025-07-22 08:53:00

Parameter-Efficient Fine-Tuning of Foundation Models for CLP Speech Classification
Susmita Bhattacharjee, Jagabandhu Mishra, H. S. Shekhawat, S. R. Mahadeva Prasanna
arxiv.org/abs/2507.14898

@arXiv_condmatdisnn_bot@mastoxiv.page
2025-07-22 16:41:06

Replaced article(s) found for cond-mat.dis-nn. arxiv.org/list/cond-mat.dis-nn
[1/1]:
- How to Train an Oscillator Ising Machine using Equilibrium Propagation
Alex Gower

@arXiv_physicsdataan_bot@mastoxiv.page
2025-09-23 08:00:30

[2025-09-23 Tue (UTC), 2 new articles found for physics.data-an Data Analysis, Statistics and Probability]
toXiv_bot_toot

@arXiv_eessAS_bot@mastoxiv.page
2025-07-23 12:43:32

Replaced article(s) found for eess.AS. arxiv.org/list/eess.AS/new
[1/1]:
- ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting
Yu Zhang, Wenxiang Guo, Changhao Pan, Zhiyuan Zhu, Tao Jin, Zhou Zhao

@arXiv_physicsdataan_bot@mastoxiv.page
2025-09-22 12:35:35

Replaced article(s) found for physics.data-an. arxiv.org/list/physics.data-an
[1/1]:
- Decomposing Interventional Causality into Synergistic, Redundant, and Unique Components
Abel Jansma

@arXiv_eessAS_bot@mastoxiv.page
2025-07-23 08:23:52

An approach to measuring the performance of Automatic Speech Recognition (ASR) models in the context of Large Language Model (LLM) powered applications
Sujith Pulikodan, Sahapthan K, Prasanta Kumar Ghosh, Visruth Sanka, Nihar Desai
arxiv.org/abs/2507.16456

@arXiv_physicsdataan_bot@mastoxiv.page
2025-09-22 11:22:35

Crosslisted article(s) found for physics.data-an. arxiv.org/list/physics.data-an
[1/1]:
- Detail Across Scales: Multi-Scale Enhancement for Full Spectrum Neural Representations
Yuan Ni, Zhantao Chen, Cheng Peng, Rajan Plumley, Chun Hong Yoon, Jana B. Thayer, Jo…

@arXiv_eessAS_bot@mastoxiv.page
2025-07-23 08:23:22

Distributed Asynchronous Device Speech Enhancement via Windowed Cross-Attention
Gene-Ping Yang, Sebastian Braun
arxiv.org/abs/2507.16104

@arXiv_physicsdataan_bot@mastoxiv.page
2025-09-22 07:56:31

[2025-09-22 Mon (UTC), no new articles found for physics.data-an Data Analysis, Statistics and Probability]
toXiv_bot_toot

@arXiv_eessAS_bot@mastoxiv.page
2025-07-23 07:49:02

[2025-07-23 Wed (UTC), 2 new articles found for eess.AS Audio and Speech Processing]
toXiv_bot_toot

@arXiv_eessAS_bot@mastoxiv.page
2025-08-22 11:31:23

Replaced article(s) found for eess.AS. arxiv.org/list/eess.AS/new
[1/1]:
- Versatile Framework for Song Generation with Prompt-based Control
Zhang, Guo, Pan, Zhu, Li, Lu, Huang, Zhang, Hong, Jiang, Zhao

@arXiv_eessAS_bot@mastoxiv.page
2025-08-22 10:52:55

Crosslisted article(s) found for eess.AS. arxiv.org/list/eess.AS/new
[1/1]:
- Denoising by neural network for muzzle blast detection
Hadrien Pujol, Matteo Bevillacqua, Christophe Thirard, Thierry Mazoyer

@arXiv_eessAS_bot@mastoxiv.page
2025-08-22 07:42:20

Mitigating Hallucinations in LM-Based TTS Models via Distribution Alignment Using GFlowNets
Chenlin Liu, Minghui Fang, Patrick Zhang, Wei Zhou, Jie Gao, Jiqing Han
arxiv.org/abs/2508.15442

@arXiv_eessAS_bot@mastoxiv.page
2025-08-22 07:39:10

Transsion Multilingual Speech Recognition System for MLC-SLM 2025 Challenge
Xiaoxiao Li, An Zhu, Youhai Jiang, Fengjie Zhu
arxiv.org/abs/2508.14916

@arXiv_eessAS_bot@mastoxiv.page
2025-08-22 07:37:20

A Chinese Heart Failure Status Speech Database with Universal and Personalised Classification
Yue Pan, Liwei Liu, Changxin Li, Xinyao Wang, Yili Xia, Hanyue Zhang, Ming Chu
arxiv.org/abs/2508.14908

@arXiv_eessAS_bot@mastoxiv.page
2025-08-22 07:37:10

[2025-08-22 Fri (UTC), 4 new articles found for eess.AS Audio and Speech Processing]
toXiv_bot_toot

@arXiv_eessAS_bot@mastoxiv.page
2025-09-22 12:30:00

Replaced article(s) found for eess.AS. arxiv.org/list/eess.AS/new
[1/1]:
- Rethinking Speaker Embeddings for Speech Generation: Sub-Center Modeling for Capturing Intra-Spea...
Ismail Rasim Ulgen, John H. L. Hansen, Carlos Busso, Berrak Sisman

@arXiv_eessAS_bot@mastoxiv.page
2025-09-22 11:36:07

Crosslisted article(s) found for eess.AS. arxiv.org/list/eess.AS/new
[2/2]:
- TISDiSS: A Training-Time and Inference-Time Scalable Framework for Discriminative Source Separation
Yongsheng Feng, Yuetonghui Xu, Jiehui Luo, Hongjia Liu, Xiaobing Li, Feng Yu, Wei Li

@arXiv_eessAS_bot@mastoxiv.page
2025-09-22 11:35:50

Crosslisted article(s) found for eess.AS. arxiv.org/list/eess.AS/new
[1/2]:
- Emotion-Aware Speech Generation with Character-Specific Voices for Comics
Zhiwen Qian, Jinhua Liang, Huan Zhang

@arXiv_eessAS_bot@mastoxiv.page
2025-09-22 09:36:01

Are Multimodal Foundation Models All That Is Needed for Emofake Detection?
Mohd Mujtaba Akhtar, Girish, Orchid Chetia Phukan, Swarup Ranjan Behera, Pailla Balakrishna Reddy, Ananda Chandra Nayak, Sanjib Kumar Nayak, Arun Balaji Buduru
arxiv.org/abs/2509.16193

@arXiv_eessAS_bot@mastoxiv.page
2025-09-22 09:31:01

Rethinking Cross-Corpus Speech Emotion Recognition Benchmarking: Are Paralinguistic Pre-Trained Representations Sufficient?
Orchid Chetia Phukan, Mohd Mujtaba Akhtar, Girish, Swarup Ranjan Behera, Parabattina Bhagath, Pailla Balakrishna Reddy, Arun Balaji Buduru
arxiv.org/abs/2509.16182

@arXiv_eessAS_bot@mastoxiv.page
2025-09-22 09:25:31

Interpreting the Role of Visemes in Audio-Visual Speech Recognition
Aristeidis Papadopoulos, Naomi Harte
arxiv.org/abs/2509.16023 arxiv.org…

@arXiv_eessAS_bot@mastoxiv.page
2025-09-22 09:19:11

VoXtream: Full-Stream Text-to-Speech with Extremely Low Latency
Nikita Torgashov, Gustav Eje Henter, Gabriel Skantze
arxiv.org/abs/2509.15969

@arXiv_eessAS_bot@mastoxiv.page
2025-09-22 09:16:21

Sound Separation and Classification with Object and Semantic Guidance
Younghoo Kwon, Jung-Woo Choi
arxiv.org/abs/2509.15899 arxiv.org/pdf/2…

@arXiv_eessAS_bot@mastoxiv.page
2025-09-22 09:01:21

Deep Dubbing: End-to-End Auto-Audiobook System with Text-to-Timbre and Context-Aware Instruct-TTS
Ziqi Dai, Yiting Chen, Jiacheng Xu, Liufei Xie, Yuchen Wang, Zhenchuan Yang, Bingsong Bai, Yangsheng Gao, Wenjiang Zhou, Weifeng Zhao, Ruohua Zhou
arxiv.org/abs/2509.15845

@arXiv_eessAS_bot@mastoxiv.page
2025-09-22 09:01:01

A Steered Response Power Method for Sound Source Localization With Generic Acoustic Models
Kaspar M\"uller, Markus Buck, Simon Doclo, Jan {\O}stergaard, Tobias Wolff
arxiv.org/abs/2509.15702

@arXiv_eessAS_bot@mastoxiv.page
2025-09-22 08:57:01

Rec-RIR: Monaural Blind Room Impulse Response Identification via DNN-based Reverberant Speech Reconstruction in STFT Domain
Pengyu Wang, Xiaofei Li
arxiv.org/abs/2509.15628

@arXiv_eessAS_bot@mastoxiv.page
2025-09-22 08:50:41

MAGENTA: Magnitude and Geometry-ENhanced Training Approach for Robust Long-Tailed Sound Event Localization and Detection
Jun-Wei Yeow, Ee-Leng Tan, Santi Peksi, Woon-Seng Gan
arxiv.org/abs/2509.15599

@arXiv_eessAS_bot@mastoxiv.page
2025-09-22 08:35:21

AFT: An Exemplar-Free Class Incremental Learning Method for Environmental Sound Classification
Xinyi Chen, Xi Chen, Zhenyu Weng, Yang Xiao
arxiv.org/abs/2509.15523

@arXiv_eessAS_bot@mastoxiv.page
2025-09-22 08:10:01

State-of-the-Art Dysarthric Speech Recognition with MetaICL for on-the-fly Personalization
Dhruuv Agarwal, Harry Zhang, Yang Yu, Quan Wang
arxiv.org/abs/2509.15516

@arXiv_eessAS_bot@mastoxiv.page
2025-09-22 08:02:21

Breathing and Semantic Pause Detection and Exertion-Level Classification in Post-Exercise Speech
Yuyu Wang, Wuyue Xia, Huaxiu Yao, Jingping Nie
arxiv.org/abs/2509.15473

@arXiv_eessAS_bot@mastoxiv.page
2025-09-22 07:37:31

Pre-training Autoencoder for Acoustic Event Classification via Blinky
Xiaoyang Liu, Yuma Kinoshita
arxiv.org/abs/2509.15261 arxiv.org/pdf/2…

@arXiv_eessAS_bot@mastoxiv.page
2025-09-22 07:37:21

[2025-09-22 Mon (UTC), 13 new articles found for eess.AS Audio and Speech Processing]
toXiv_bot_toot

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 08:31:30

Harmonic Summation-Based Robust Pitch Estimation in Noisy and Reverberant Environments
Anup Singh, Kris Demuynck
arxiv.org/abs/2509.16480 a…

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 08:05:10

Sound field estimation with moving microphones using kernel ridge regression
Jesper Brunnstr\"om, Martin Bo M{\o}ller, Jan {\O}stergaard, Shoichi Koyama, Toon van Waterschoot, Marc Moonen
arxiv.org/abs/2509.16358

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 07:49:20

Similarity-Guided Diffusion for Long-Gap Music Inpainting
Sean Turland, Eloi Moliner, Vesa V\"alim\"aki
arxiv.org/abs/2509.16342

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 07:41:09

Investigating Polyglot Speech Foundation Models for Learning Collective Emotion from Crowds
Orchid Chetia Phukan, Girish, Mohd Mujtaba Akhtar, Panchal Nayak, Priyabrata Mallick, Swarup Ranjan Behera, Parabattina Bhagath, Pailla Balakrishna Reddy, Arun Balaji Buduru
arxiv.org/abs/2509.16329

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 07:38:59

[2025-09-23 Tue (UTC), 27 new articles found for eess.AS Audio and Speech Processing]
toXiv_bot_toot

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 17:10:26

Replaced article(s) found for eess.AS. arxiv.org/list/eess.AS/new
[2/2]:
- Compose Yourself: Average-Velocity Flow Matching for One-Step Speech Enhancement
Gang Yang, Yue Lei, Wenxin Tai, Jin Wu, Jia Chen, Ting Zhong, Fan Zhou

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 17:09:42

Replaced article(s) found for eess.AS. arxiv.org/list/eess.AS/new
[1/2]:
- Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement
Yudong Yang, Zhan Liu, Wenyi Yu, Guangzhi Sun, Qiuqiang Kong, Chao Zhang

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 14:48:06

Crosslisted article(s) found for eess.AS. arxiv.org/list/eess.AS/new
[2/2]:
- STAR: Speech-to-Audio Generation via Representation Learning
Zeyu Xie, Xuenan Xu, Yixuan Li, Mengyue Wu, Yuexian Zou

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 14:47:35

Crosslisted article(s) found for eess.AS. arxiv.org/list/eess.AS/new
[1/2]:
- LenslessMic: Audio Encryption and Authentication via Lensless Computational Imaging
Petr Grinberg, Eric Bezzam, Paolo Prandoni, Martin Vetterli

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 11:22:20

Nord-Parl-TTS: Finnish and Swedish TTS Dataset from Parliament Speech
Zirui Li, Jens Edlund, Yicheng Gu, Nhan Phan, Lauri Juvela, Mikko Kurimo
arxiv.org/abs/2509.17988

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 11:20:20

Benchmarking Humans and Machines on Complex Multilingual Speech Understanding Tasks
Sai Samrat Kankanala, Ram Chandra, Sriram Ganapathy
arxiv.org/abs/2509.17965

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 11:18:41

GAN-Based Multi-Microphone Spatial Target Speaker Extraction
Shrishti Saha Shetu, Emanu\"el A. P. Habets, Andreas Brendel
arxiv.org/abs/2509.17741

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 11:01:51

Comparator Loss: An Ordinal Contrastive Loss to Derive a Severity Score for Speech-based Health Monitoring
Jacob J Webber, Oliver Watts, Lovisa Wihlborg, David Wheatley, Johnny Tam, Christine Weaver, Suvankar Pal, Siddharthan Chandran, Cassia Valentini-Botinhao
arxiv.org/abs/2509.17661

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 10:57:41

Audiobook-CC: Controllable Long-context Speech Generation for Multicast Audiobook
Min Liu, JingJing Yin, Xiang Zhang, Siyu Hao, Yanni Hu, Bin Lin, Yuan Feng, Hongbin Zhou, Jianhao Ye
arxiv.org/abs/2509.17516

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 10:50:01

FUN-SSL: Full-band Layer Followed by U-Net with Narrow-band Layers for Multiple Moving Sound Source Localization
Yuseon Choi, Hyeonseung Kim, Jewoo Jun, Jong Won Shin
arxiv.org/abs/2509.17490

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 10:37:40

Neural acoustic multipole splatting for room impulse response synthesis
Geonwoo Baek, Jung-Woo Choi
arxiv.org/abs/2509.17410 arxiv.org/pdf/…

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 10:35:30

SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Transcription
Wei Tan, Shun Lei, Huaicheng Zhang, Guangzheng Li, Yixuan Zhang, Hangting Chen, Jianwei Yu, Rongzhi Gu, Dong Yu
arxiv.org/abs/2509.17404

@arXiv_eessAS_bot@mastoxiv.page
2025-09-23 10:31:41

Improving Active Learning for Melody Estimation by Disentangling Uncertainties
Aayush Jaiswal, Parampreet Singh, Vipul Arora
arxiv.org/abs/2509.17375