Tootfinder

Opt-in global Mastodon full text search. Join the index!

@izzychambers@vivaldi.net
2025-06-24 14:00:18

@… don't use it

@izzychambers@vivaldi.net
2025-06-24 13:54:04

@… You have a bright future ahead of you!

@shriramk@mastodon.social
2025-06-24 17:39:10

Edmund Burke, describing DOGE.

 Young men (boys almost) govern there, without society, and without
sympathy with the natives. They have no more social habits with the people, than
if they still resided in England; nor indeed any species of intercourse but that
which is necessary to making a sudden fortune, with a view to a remote
settlement. Animated with all the avarice of age, and all the impetuosity of youth,
they roll in one after another; wave after wave; and there is nothing before the
eyes of the natives but an endles…
@_tillwe_@mastodon.social
2025-07-24 20:03:00

Gute Nacht!

Eine zusammengerollt schlafende schwarze Katze, die ihre Pfoten in die Luft hält.
@weddingweiser@berlin.social
2025-06-23 07:42:27

Guten Morgen Wedding! Usedomer Straße

Baumgesäumte Straße in Berlin-Wedding mit Blick auf die Spitze der Sebastiankirche, die über den Bäumen hervorragt.
@Erikmitk@mastodon.gamedev.place
2025-07-24 10:08:08

#nytimes introduced Midi crosswords via their newsletter this week. So far, so cool!
But how do I find the current one rather than adjusting the URL from the first one via newsletter to today’s date? I cannot see Midi anywhere on the #nytgames website nor in the app.

URL bar in mobile safari showing the URL for the first midi crossword in the nyt website. Highlighted is the url path with the ‘crosswords/game/paid/midi-07-24-25’ path highlighted
Menu of the NYT games section showing in a list but no entry for the midi. Visible are:

The Crossword
The Mini
Connections
Spelling Bee
Wordle Letter Boxed
Strands
Tiles
@adrianco@mastodon.social
2025-07-23 19:01:39

The dove pair that nest in our eaves just successfully raised these two chicks. They were alone on the nest yesterday and were gone today. They raised a single chick on the same nest earlier this year. Very clean, we see the family return every year. #naturephotography #birdphotography

Two dove chicks on a nest with white painted beam behind.
@shriramk@mastodon.social
2025-07-23 13:07:33

Oh lordy, what a timeline.

search engine completion for "can you sell a book" with "written by ai"
@toxi@mastodon.thi.ng
2025-06-22 13:01:26

Various thi.ng updates, bug fixes, additions and new version of github.com/thi-ng/zig-thing/ — now fully compatible with current Zig v0.14.1
On a more diary/devlog note: I also updated several of my Zig based work-in-progress art pieces to the latest version (some of them not touc…

Still image/poster of DANZA, an abstract, generative physics-based realtime animation. The composition shows multiple overlapping patches of cloth sims, each represented by tens of thousands of small dots, each patch in different colors.
Still image of S-TRACE, an abstract generative realtime animation based on omnidirectional sphere tracing and multiple agents exploring the constantly changing positive & negative spaces
@bici@mastodon.social
2025-08-23 00:01:05

The refurbished sign (2006) for The Only Sea Foods Restaurant at 20 East Hastings Street.
#yvr #vancouver #Neon

The refurbished sign (2006) for The Only Sea Foods Restaurant at 20 East Hastings Street.
@gardenscorpion@osna.social
2025-06-23 10:06:34

🤡
mobil.osnabrueck.de/de/aktuell…

@nohillside@smnn.ch
2025-07-21 17:36:33

„LLMs promise that you can be a part of a community and all the status and clout that comes with without putting in the work. And I do not see that there is a path to mend that foundational issue.“
tante.cc/2025/07/21/but-will-t

@aral@mastodon.ar.al
2025-07-21 10:22:10

🥳 New Kitten Release
• Improved Markdown parser
Kitten’s JavaScript tagged template strings (`kitten.html`) no longer fail to render as expected when interpolated values are used inside of Markdown where the Markdown render changes source order.
So, for example, the following will now work correctly, whereas, previously, the link source and link text would have been erroneously flipped:
kitten.html`
<markdown>
[${linkText}](${linkSource})
…

@arXiv_csSD_bot@mastoxiv.page
2025-07-23 07:59:22

A new XML conversion process for mensural music encoding : CMME\_to\_MEI (via Verovio)
David Fiala (CESR), Laurent Pugin (KNAW), Marnix van Berchum (KNAW), Martha Thomae (NOVA), K\'evin Roger (CESR, UL, CRULH)
arxiv.org/abs/2507.15991

@cjust@infosec.exchange
2025-07-14 12:23:16

#ShamelesslyStolenFromTumblr

ashestoashesic
me, a sensible boy, feeling a tickle: just your leg hair, calm down
caveman brain: it is so many spiders
@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 16:46:02

Replaced article(s) found for cond-mat.stat-mech. arxiv.org/list/cond-mat.stat-m
[1/1]:
- Quantum Potts Models on the Sierpi\'nski Pyramid
Kr\v{c}m\'ar, Zelenayov\'a, Genzor, Caha, Rap\v{c}an, Nishino, Gendiar

@Erikmitk@mastodon.gamedev.place
2025-08-22 19:05:14

Pretty happy with this Spongebob I drew from memory! 🧽

@adrianco@mastodon.social
2025-08-22 07:03:32

A personal update, I’m going to be based in Europe for most of the next two months. I’m currently in Weymouth UK setting up a long term rental home for us (we got tired of random AirBnBs) close to my aging parents. My father is in a care home now, and my mother is adjusting to it…

@arXiv_csSD_bot@mastoxiv.page
2025-07-23 07:48:12

Nonlinear Framework for Speech Bandwidth Extension
Tarikul Islam Tamiti, Nursad Mamun, Anomadarshi Barua
arxiv.org/abs/2507.15970

@cjust@infosec.exchange
2025-07-14 20:11:51

There is no spoon.


Do Not try to get them to release the list. That's impossible. Instead only try to realize the truth.

What truth?

They're all on the list.
@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 10:52:30

Tensor network calculation of boundary and corner magnetization
Roman Krcmar, Jozef Genzor, Andrej Gendiar, Tomotoshi Nishino
arxiv.org/abs/2506.17194

@arXiv_csSD_bot@mastoxiv.page
2025-07-23 07:40:02

[2025-07-23 Wed (UTC), 9 new articles found for cs.SD Sound]
toXiv_bot_toot

@cjust@infosec.exchange
2025-07-14 18:32:21

Steve Irwin died as he lived - with animals in his heart.
#ShamelesslyStolenFromTheOtherSite

Connor Stone

@connorstonehere
God: Welcome to heaven.
Me: Why is only Steve Irwin here?
God: Steve's the only person who
made the cut.
Me: Can he teach me about
animals?
God: No, you didn't make the cut.
| just like to show off Steve Irwin
to people before | drop them into
Hell.
10:17 AM - 22 Feb 20 - Twitter for Android
@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 10:40:10

Partition function for several Ising model interface structures
Alessio Squarcini, Piotr Nowakowski, Douglas B. Abraham, Anna Macio{\l}ek
arxiv.org/abs/2506.17170

@arXiv_csSD_bot@mastoxiv.page
2025-07-23 12:41:59

Replaced article(s) found for cs.SD. arxiv.org/list/cs.SD/new
[1/1]:
- ReMi: A Random Recurrent Neural Network Approach to Music Production
Hugo Chateau-Laurent, Tara Vanhatalo, Wei-Tung Pan, Xavier Hinaut

@cjust@infosec.exchange
2025-07-14 18:05:25

from
https[:]//bsky.app/profile/sadiston.bsky.social/post/3kopghk5ljl23
#ShamelesslyStolenFromBlueSky

VIZUAL NINJA / PESTY LOVER
@sadiston.bsky.social
Excellent example of an armadillo getting a jump scare.
@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 10:37:00

Phase Transition of the Ising Model on a 3-Dimensional Fractal Lattice
Jozef Genzor, Roman Kr\v{c}m\'ar, Hiroshi Ueda, Denis Kochan, Andrej Gendiar, Tomotoshi Nishino
arxiv.org/abs/2506.17053

@arXiv_csSD_bot@mastoxiv.page
2025-07-23 09:31:22

SALM: Spatial Audio Language Model with Structured Embeddings for Understanding and Editing
Jinbo Hu, Yin Cao, Ming Wu, Feiran Yang, Jun Yang
arxiv.org/abs/2507.16724

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 10:35:30

Mean-field and Monte Carlo Analysis of Multi-Species Dynamics of agents
Eduardo Velasco Stock, Roberto da Silva, Sebastian Gon\c{c}alves
arxiv.org/abs/2506.16717

@arXiv_csSD_bot@mastoxiv.page
2025-07-23 09:29:02

TTMBA: Towards Text To Multiple Sources Binaural Audio Generation
Yuxuan He, Xiaoran Yang, Ningning Pan, Gongping Huang
arxiv.org/abs/2507.16564

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 10:35:20

Crystal Nucleation Kinetics and Mechanism: Influence of Interaction Potential
Porhouy Minh, Steven W. Hall, Ryan S. DeFever, Sapna Sarupria
arxiv.org/abs/2506.16541

@arXiv_csSD_bot@mastoxiv.page
2025-07-23 09:19:52

Detect Any Sound: Open-Vocabulary Sound Event Detection with Multi-Modal Queries
Pengfei Cai, Yan Song, Qing Gu, Nan Jiang, Haoyu Song, Ian McLoughlin
arxiv.org/abs/2507.16343

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 10:27:30

Transfer-matrix approach to the Blume-Capel model on the triangular lattice
Dimitrios Mataragkas, Alexandros Vasilopoulos, Nikolaos G. Fytas, Dong-Hee Kim
arxiv.org/abs/2506.16483

@arXiv_csSD_bot@mastoxiv.page
2025-07-23 09:18:42

Robust Bioacoustic Detection via Richly Labelled Synthetic Soundscape Augmentation
Kaspar Soltero, Tadeu Siqueira, Stefanie Gutschmidt
arxiv.org/abs/2507.16235

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 10:19:00

A General Framework for Linking Free and Forced Fluctuations via Koopmanism
Valerio Lucarini, Manuel Santos Gutierrez, John Moroney, Niccol\`o Zagli
arxiv.org/abs/2506.16446

@arXiv_csSD_bot@mastoxiv.page
2025-07-23 09:15:42

LENS-DF: Deepfake Detection and Temporal Localization for Long-Form Noisy Speech
Xuechen Liu, Wanying Ge, Xin Wang, Junichi Yamagishi
arxiv.org/abs/2507.16220

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 10:07:10

Method of canonical transformations in the theory of quantum gases interacting with radiation
M. S. Bulakhov, A. S. Peletminskii, P. P. Kostrobij, I. A. Ryzha, Yu. V. Slyusarenko
arxiv.org/abs/2506.16439

@arXiv_csSD_bot@mastoxiv.page
2025-07-23 09:12:12

LABNet: A Lightweight Attentive Beamforming Network for Ad-hoc Multichannel Microphone Invariant Real-Time Speech Enhancement
Haoyin Yan, Jie Zhang, Chengqian Jiang, Shuang Zhang
arxiv.org/abs/2507.16190

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 10:03:50

Endoreversible Stirling cycles: plasma engines at maximal power
Gregory Behrendt, Sebastian Deffner
arxiv.org/abs/2506.16303

@arXiv_csSD_bot@mastoxiv.page
2025-07-23 09:08:52

SDBench: A Comprehensive Benchmark Suite for Speaker Diarization
Eduardo Pacheco, Atila Orhon, Berkin Durmus, Blaise Munyampirwa, Andrey Leonov
arxiv.org/abs/2507.16136

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 10:03:00

Correcting systematic errors in the likelihood optimization of underdamped Langevin models of molecular dynamics trajectories
David Daniel Girardier, Hadrien Vroylandt, Sara Bonella, Fabio Pietrucci
arxiv.org/abs/2506.16272

@arXiv_csSD_bot@mastoxiv.page
2025-07-24 08:52:00

Audio-Vision Contrastive Learning for Phonological Class Recognition
Daiqi Liu, Tom\'as Arias-Vergara, Jana Hutter, Andreas Maier, Paula Andrea P\'erez-Toro
arxiv.org/abs/2507.17682

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 10:01:30

Microcanonical simulated annealing: Massively parallel Monte Carlo simulations with sporadic random-number generation
M. Bernaschi, L. A. Fernandez, I. Gonz\'alez-Adalid Pemart\'in, E. Marinari, V. Martin-Mayor, G. Parisi, F. Ricci-Tersenghi, J. J. Ruiz-Lorenzo, D. Yllanes
arxiv.org/abs/2506.16240

@arXiv_csSD_bot@mastoxiv.page
2025-07-24 08:41:10

BoSS: Beyond-Semantic Speech
Qing Wang, Zehan Li, Hang Lv, Hongjie Chen, Yaodong Song, Jian Kang, Jie Lian, Jie Li, Yongxiang Li, Zhongjiang He, Xuelong Li
arxiv.org/abs/2507.17563

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 09:53:40

Validity of generalized Gibbs ensemble in a random matrix model with a global $\mathbb{Z}_2$-symmetry
Adway Kumar Das
arxiv.org/abs/2506.16176

@arXiv_csSD_bot@mastoxiv.page
2025-07-24 08:11:19

Application of Whisper in Clinical Practice: the Post-Stroke Speech Assessment during a Naming Task
Milena Davudova, Ziyuan Cai, Valentina Giunchiglia, Dragos C. Gruia, Giulia Sanguedolce, Adam Hampshire, Fatemeh Geranmayeh
arxiv.org/abs/2507.17326

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 09:45:30

Microstates : Do the outliers worth
Olivier Sire
arxiv.org/abs/2506.16080 arxiv.org/pdf/2506.16080

@arXiv_csSD_bot@mastoxiv.page
2025-07-24 08:02:49

On Temporal Guidance and Iterative Refinement in Audio Source Separation
Tobias Morocutti, Jonathan Greif, Paul Primus, Florian Schmid, Gerhard Widmer
arxiv.org/abs/2507.17297

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 09:34:10

Unifying renormalized and bare viscosity in two-dimensional molecular dynamics simulations
Kazuma Yokota, Masato Itami, Shin-ichi Sasa
arxiv.org/abs/2506.16002

@arXiv_csSD_bot@mastoxiv.page
2025-07-24 07:38:09

Weak Supervision Techniques towards Enhanced ASR Models in Industry-level CRM Systems
Zhongsheng Wang, Sijie Wang, Jia Wang, Yung-I Liang, Yuxi Zhang, Jiamou Liu
arxiv.org/abs/2507.16843

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 09:10:50

Generalized Spectral Statistics in the Kicked Ising model
Divij Gupta, Brian Swingle
arxiv.org/abs/2506.15816 arxiv.o…

@arXiv_csSD_bot@mastoxiv.page
2025-07-24 07:37:59

[2025-07-24 Thu (UTC), 5 new articles found for cs.SD Sound]
toXiv_bot_toot

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 08:53:20

Breakdown of the thermodynamic limit in quantum spin and dimer models
Jeet Shah, Laura Shou, Jeremy Shuler, Victor Galitski
arxiv.org/abs/2506.15769

@arXiv_csSD_bot@mastoxiv.page
2025-07-24 12:44:43

Replaced article(s) found for cs.SD. arxiv.org/list/cs.SD/new
[1/1]:
- Koel-TTS: Enhancing LLM based Speech Generation with Preference Alignment and Classifier Free Gui...
Hussain, Neekhara, Yang, Casanova, Ghosh, Desta, Fejgin, Valle, Li

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 08:47:50

Two Types of Temporal Symmetry in the Laws of Nature
A. Y. Klimenko
arxiv.org/abs/2506.15730 arxiv.org/pdf/2506.15730…

@arXiv_csSD_bot@mastoxiv.page
2025-06-23 10:33:30

Universal Music Representations? Evaluating Foundation Models on World Music Corpora
Charilaos Papaioannou, Emmanouil Benetos, Alexandros Potamianos
arxiv.org/abs/2506.17055

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 08:38:10

Thermodynamics and Legendre Duality in Optimal Networks
Amilcare Porporato, Shashank Kumar Anand, Salvatore Calabrese, Luca Ridolfi, Lamberto Rondoni
arxiv.org/abs/2506.15727

@arXiv_csSD_bot@mastoxiv.page
2025-06-23 10:33:20

ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors
Junghyun Koo, Marco A. Martinez-Ramirez, Wei-Hsiang Liao, Giorgio Fabbro, Michele Mancusi, Yuki Mitsufuji
arxiv.org/abs/2506.16889

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-23 08:17:00

[2025-06-23 Mon (UTC), 18 new articles found for cond-mat.stat-mech Statistical Mechanics]
toXiv_bot_toot

@arXiv_csSD_bot@mastoxiv.page
2025-06-23 10:26:40

Hybrid-Sep: Language-queried audio source separation via pre-trained Model Fusion and Adversarial Diffusion Training
Jianyuan Feng, Guangzheng Li, Yangfei Xu
arxiv.org/abs/2506.16833

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-24 17:33:23

Replaced article(s) found for cond-mat.stat-mech. arxiv.org/list/cond-mat.stat-m
[1/1]:
- Lectures on Statistical Mechanics
Allan N. Kaufman, Bruce I. Cohen, Alain J. Brizard

@arXiv_csSD_bot@mastoxiv.page
2025-06-23 10:23:10

Learning Magnitude Distribution of Sound Fields via Conditioned Autoencoder
Shoichi Koyama, Kenji Ishizuka
arxiv.org/abs/2506.16729 …

@arXiv_csSD_bot@mastoxiv.page
2025-06-23 10:21:30

Towards Bitrate-Efficient and Noise-Robust Speech Coding with Variable Bitrate RVQ
Yunkee Chae, Kyogu Lee
arxiv.org/abs/2506.16538

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-24 10:41:10

Nonequilibrium orders in parametrically driven field theories
Carl Philipp Zelle, Romain Daviet, Andrew J. Millis, Sebastian Diehl
arxiv.org/abs/2506.18622

@arXiv_csSD_bot@mastoxiv.page
2025-06-23 10:14:30

AeroGPT: Leveraging Large-Scale Audio Model for Aero-Engine Bearing Fault Diagnosis
Jiale Liu, Dandan Peng, Huan Wang, Chenyu Liu, Yan-Fu Li, Min Xie
arxiv.org/abs/2506.16225

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-24 10:29:00

Stability of universal properties against perturbations of the Markov Chain Monte Carlo algorithm
Matteo Bacci, Claudio Bonati
arxiv.org/abs/2506.18561

@arXiv_csSD_bot@mastoxiv.page
2025-06-23 10:08:00

Improved Intelligibility of Dysarthric Speech using Conditional Flow Matching
Shoutrik Das, Nishant Singh, Arjun Gangwar, S Umesh
arxiv.org/abs/2506.16127

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-24 10:24:20

Two dimensional Coulomb gas in a non-conservative trap
David S. Dean, Rashed Aljasmi, Satya N. Majumdar
arxiv.org/abs/2506.18551

@arXiv_csSD_bot@mastoxiv.page
2025-06-23 09:20:20

VS-Singer: Vision-Guided Stereo Singing Voice Synthesis with Consistency Schr\"odinger Bridge
Zijing Zhao, Kai Wang, Hao Huang, Ying Hu, Liang He, Jichen Yang
arxiv.org/abs/2506.16020

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-24 09:54:00

An Extended Model of Fractional-Dimensional Space for Anisotropic Solids with Deformed Derivatives
Jos\'e Weberszpil
arxiv.org/abs/2506.18127

@arXiv_csSD_bot@mastoxiv.page
2025-06-23 09:08:30

Sonic4D: Spatial Audio Generation for Immersive 4D Scene Exploration
Siyi Xie, Hanxin Zhu, Tianyu He, Xin Li, Zhibo Chen
arxiv.org/abs/2506.15759

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-24 09:37:40

Non-algebraic first return probability of a stretched random walk near a convex boundary and its effect on adsorption
Daniil Fedotov, Sergei Nechaev
arxiv.org/abs/2506.17829

@arXiv_csSD_bot@mastoxiv.page
2025-06-23 08:48:10

Explainable speech emotion recognition through attentive pooling: insights from attention-based temporal localization
Tahitoa Leygue (DIASI), Astrid Sabourin (DIASI), Christian Bolzmacher (DIASI), Sylvain Bouchigny (DIASI), Margarita Anastassova (DIASI), Quoc-Cuong Pham (DIASI)
arxiv.org/abs/2506.15754

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-24 09:01:09

Wealth Thermalization Hypothesis
Klaus M. Frahm, Dima L. Shepelyansky
arxiv.org/abs/2506.17720 arxiv.org/pdf/2506.177…

@arXiv_csSD_bot@mastoxiv.page
2025-06-23 08:15:20

[2025-06-23 Mon (UTC), 10 new articles found for cs.SD Sound]
toXiv_bot_toot

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-24 08:57:10

The free propagator of strongly anisotropic systems with free surfaces
M. A. Shpot
arxiv.org/abs/2506.17595 arxiv.org…

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 17:00:18

Replaced article(s) found for cs.SD. arxiv.org/list/cs.SD/new
[1/1]:
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative ...
Dominik Wagner, Ilja Baumann, Tobias Bocklet

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-24 08:49:50

Finite time path field theory and a new type of universal quantum spin chain quench behaviour
Domagoj Kui\'c, Alemka Knapp, Diana \v{S}aponja-Milutinovi\'c
arxiv.org/abs/2506.17402

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 11:11:50

USAD: Universal Speech and Audio Representation via Distillation
Heng-Jui Chang, Saurabhchand Bhati, James Glass, Alexander H. Liu
arxiv.org/abs/2506.18843

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-06-24 08:34:20

[2025-06-24 Tue (UTC), 8 new articles found for cond-mat.stat-mech Statistical Mechanics]
toXiv_bot_toot

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 11:03:20

MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners
Fang-Duo Tsai, Shih-Lun Wu, Weijaw Lee, Sheng-Ping Yang, Bo-Rui Chen, Hao-Chung Cheng, Yi-Hsuan Yang
arxiv.org/abs/2506.18729

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 11:01:00

Frequency-Weighted Training Losses for Phoneme-Level DNN-based Speech Enhancement
Nasser-Eddine Monir, Paul Magron, Romain Serizel
arxiv.org/abs/2506.18714

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 10:58:20

Evaluating Multichannel Speech Enhancement Algorithms at the Phoneme Scale Across Genders
Nasser-Eddine Monir, Paul Magron, Romain Serizel
arxiv.org/abs/2506.18691

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 10:57:20

TCDiff : An End-to-end Trajectory-Controllable Diffusion Model for Harmonious Music-Driven Group Choreography
Yuqin Dai, Wanlu Zhu, Ronghui Li, Xiu Li, Zhenyu Zhang, Jun Li, Jian Yang
arxiv.org/abs/2506.18671

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 10:55:20

Smooth Operators: LLMs Translating Imperfect Hints into Disfluency-Rich Transcripts
Duygu Altinok
arxiv.org/abs/2506.18510

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 10:43:20

AI-Generated Song Detection via Lyrics Transcripts
Markus Frohmann, Elena V. Epure, Gabriel Meseguer-Brocal, Markus Schedl, Romain Hennequin
arxiv.org/abs/2506.18488

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 10:39:10

Selecting N-lowest scores for training MOS prediction models
Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko
arxiv.org/abs/2506.18326

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 10:38:50

Large-Scale Training Data Attribution for Music Generative Models via Unlearning
Woosung Choi, Junghyun Koo, Kin Wai Cheuk, Joan Serr\`a, Marco A. Mart\'inez-Ram\'irez, Yukara Ikemiya, Naoki Murata, Yuhta Takida, Wei-Hsiang Liao, Yuki Mitsufuji
arxiv.org/abs/2506.18312

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 10:35:00

Rethinking Mean Opinion Scores in Speech Quality Assessment: Aggregation through Quantized Distribution Fitting
Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko
arxiv.org/abs/2506.18307

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 10:26:40

JIS: A Speech Corpus of Japanese Idol Speakers with Various Speaking Styles
Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko
arxiv.org/abs/2506.18296

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 10:26:20

Human Voice is Unique
Rita Singh, Bhiksha Raj
arxiv.org/abs/2506.18182 arxiv.org/pdf/2506.18182

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 10:19:20

GD-Retriever: Controllable Generative Text-Music Retrieval with Diffusion Models
Julien Guinot, Elio Quinton, Gy\"orgy Fazekas
arxiv.org/abs/2506.17886

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 10:13:20

CultureMERT: Continual Pre-Training for Cross-Cultural Music Representation Learning
Angelos-Nikolaos Kanatas, Charilaos Papaioannou, Alexandros Potamianos
arxiv.org/abs/2506.17818

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 09:24:00

SLAP: Siamese Language-Audio Pretraining Without Negative Samples for Music Understanding
Julien Guinot, Alain Riou, Elio Quinton, Gy\"orgy Fazekas
arxiv.org/abs/2506.17815

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 09:18:00

Algebraic Structures in Microtonal Music
Veronica Flynn, Carmen Rovi
arxiv.org/abs/2506.17778 arxiv.org/pdf/2506.1777…

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 09:06:40

From Generality to Mastery: Composer-Style Symbolic Music Generation via Large-Scale Pre-training
Mingyang Yao, Ke Chen
arxiv.org/abs/2506.17497

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 08:23:40

Adaptive Control Attention Network for Underwater Acoustic Localization and Domain Adaptation
Quoc Thinh Vo, Joe Woods, Priontu Chowdhury, David K. Han
arxiv.org/abs/2506.17409

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 08:17:20

Zero-Shot Cognitive Impairment Detection from Speech Using AudioLLM
Mostafa Shahin, Beena Ahmed, Julien Epps
arxiv.org/abs/2506.17351

@arXiv_csSD_bot@mastoxiv.page
2025-06-24 08:10:30

[2025-06-24 Tue (UTC), 19 new articles found for cs.SD Sound]
toXiv_bot_toot

@arXiv_csSD_bot@mastoxiv.page
2025-06-23 16:10:10

Replaced article(s) found for cs.SD. arxiv.org/list/cs.SD/new
[1/1]:
- TAPS: Throat and Acoustic Paired Speech Dataset for Deep Learning-Based Speech Enhancement
Yunsik Kim, Yonghun Song, Yoonyoung Chung