
2025-06-24 14:00:18
@… don't use it
@… don't use it
@… You have a bright future ahead of you!
The dove pair that nest in our eaves just successfully raised these two chicks. They were alone on the nest yesterday and were gone today. They raised a single chick on the same nest earlier this year. Very clean, we see the family return every year. #naturephotography #birdphotography
Various thi.ng updates, bug fixes, additions and new version of https://github.com/thi-ng/zig-thing/ — now fully compatible with current Zig v0.14.1
On a more diary/devlog note: I also updated several of my Zig based work-in-progress art pieces to the latest version (some of them not touc…
The refurbished sign (2006) for The Only Sea Foods Restaurant at 20 East Hastings Street.
#yvr #vancouver #Neon
„LLMs promise that you can be a part of a community and all the status and clout that comes with without putting in the work. And I do not see that there is a path to mend that foundational issue.“
https://tante.cc/2025/07/21/but-will-they/
🥳 New Kitten Release
• Improved Markdown parser
Kitten’s JavaScript tagged template strings (`kitten.html`) no longer fail to render as expected when interpolated values are used inside of Markdown where the Markdown render changes source order.
So, for example, the following will now work correctly, whereas, previously, the link source and link text would have been erroneously flipped:
kitten.html`
<markdown>
[${linkText}](${linkSource})
…
A new XML conversion process for mensural music encoding : CMME\_to\_MEI (via Verovio)
David Fiala (CESR), Laurent Pugin (KNAW), Marnix van Berchum (KNAW), Martha Thomae (NOVA), K\'evin Roger (CESR, UL, CRULH)
https://arxiv.org/abs/2507.15991
Replaced article(s) found for cond-mat.stat-mech. https://arxiv.org/list/cond-mat.stat-mech/new
[1/1]:
- Quantum Potts Models on the Sierpi\'nski Pyramid
Kr\v{c}m\'ar, Zelenayov\'a, Genzor, Caha, Rap\v{c}an, Nishino, Gendiar
A personal update, I’m going to be based in Europe for most of the next two months. I’m currently in Weymouth UK setting up a long term rental home for us (we got tired of random AirBnBs) close to my aging parents. My father is in a care home now, and my mother is adjusting to it…
Nonlinear Framework for Speech Bandwidth Extension
Tarikul Islam Tamiti, Nursad Mamun, Anomadarshi Barua
https://arxiv.org/abs/2507.15970 https://
Tensor network calculation of boundary and corner magnetization
Roman Krcmar, Jozef Genzor, Andrej Gendiar, Tomotoshi Nishino
https://arxiv.org/abs/2506.17194
[2025-07-23 Wed (UTC), 9 new articles found for cs.SD Sound]
toXiv_bot_toot
Steve Irwin died as he lived - with animals in his heart.
#ShamelesslyStolenFromTheOtherSite
Partition function for several Ising model interface structures
Alessio Squarcini, Piotr Nowakowski, Douglas B. Abraham, Anna Macio{\l}ek
https://arxiv.org/abs/2506.17170
Replaced article(s) found for cs.SD. https://arxiv.org/list/cs.SD/new
[1/1]:
- ReMi: A Random Recurrent Neural Network Approach to Music Production
Hugo Chateau-Laurent, Tara Vanhatalo, Wei-Tung Pan, Xavier Hinaut
from
https[:]//bsky.app/profile/sadiston.bsky.social/post/3kopghk5ljl23
#ShamelesslyStolenFromBlueSky
Phase Transition of the Ising Model on a 3-Dimensional Fractal Lattice
Jozef Genzor, Roman Kr\v{c}m\'ar, Hiroshi Ueda, Denis Kochan, Andrej Gendiar, Tomotoshi Nishino
https://arxiv.org/abs/2506.17053
SALM: Spatial Audio Language Model with Structured Embeddings for Understanding and Editing
Jinbo Hu, Yin Cao, Ming Wu, Feiran Yang, Jun Yang
https://arxiv.org/abs/2507.16724
Mean-field and Monte Carlo Analysis of Multi-Species Dynamics of agents
Eduardo Velasco Stock, Roberto da Silva, Sebastian Gon\c{c}alves
https://arxiv.org/abs/2506.16717
TTMBA: Towards Text To Multiple Sources Binaural Audio Generation
Yuxuan He, Xiaoran Yang, Ningning Pan, Gongping Huang
https://arxiv.org/abs/2507.16564 ht…
Crystal Nucleation Kinetics and Mechanism: Influence of Interaction Potential
Porhouy Minh, Steven W. Hall, Ryan S. DeFever, Sapna Sarupria
https://arxiv.org/abs/2506.16541
Detect Any Sound: Open-Vocabulary Sound Event Detection with Multi-Modal Queries
Pengfei Cai, Yan Song, Qing Gu, Nan Jiang, Haoyu Song, Ian McLoughlin
https://arxiv.org/abs/2507.16343
Transfer-matrix approach to the Blume-Capel model on the triangular lattice
Dimitrios Mataragkas, Alexandros Vasilopoulos, Nikolaos G. Fytas, Dong-Hee Kim
https://arxiv.org/abs/2506.16483
Robust Bioacoustic Detection via Richly Labelled Synthetic Soundscape Augmentation
Kaspar Soltero, Tadeu Siqueira, Stefanie Gutschmidt
https://arxiv.org/abs/2507.16235
A General Framework for Linking Free and Forced Fluctuations via Koopmanism
Valerio Lucarini, Manuel Santos Gutierrez, John Moroney, Niccol\`o Zagli
https://arxiv.org/abs/2506.16446
LENS-DF: Deepfake Detection and Temporal Localization for Long-Form Noisy Speech
Xuechen Liu, Wanying Ge, Xin Wang, Junichi Yamagishi
https://arxiv.org/abs/2507.16220
Method of canonical transformations in the theory of quantum gases interacting with radiation
M. S. Bulakhov, A. S. Peletminskii, P. P. Kostrobij, I. A. Ryzha, Yu. V. Slyusarenko
https://arxiv.org/abs/2506.16439
LABNet: A Lightweight Attentive Beamforming Network for Ad-hoc Multichannel Microphone Invariant Real-Time Speech Enhancement
Haoyin Yan, Jie Zhang, Chengqian Jiang, Shuang Zhang
https://arxiv.org/abs/2507.16190
Endoreversible Stirling cycles: plasma engines at maximal power
Gregory Behrendt, Sebastian Deffner
https://arxiv.org/abs/2506.16303 https://
SDBench: A Comprehensive Benchmark Suite for Speaker Diarization
Eduardo Pacheco, Atila Orhon, Berkin Durmus, Blaise Munyampirwa, Andrey Leonov
https://arxiv.org/abs/2507.16136
Correcting systematic errors in the likelihood optimization of underdamped Langevin models of molecular dynamics trajectories
David Daniel Girardier, Hadrien Vroylandt, Sara Bonella, Fabio Pietrucci
https://arxiv.org/abs/2506.16272
Audio-Vision Contrastive Learning for Phonological Class Recognition
Daiqi Liu, Tom\'as Arias-Vergara, Jana Hutter, Andreas Maier, Paula Andrea P\'erez-Toro
https://arxiv.org/abs/2507.17682
Microcanonical simulated annealing: Massively parallel Monte Carlo simulations with sporadic random-number generation
M. Bernaschi, L. A. Fernandez, I. Gonz\'alez-Adalid Pemart\'in, E. Marinari, V. Martin-Mayor, G. Parisi, F. Ricci-Tersenghi, J. J. Ruiz-Lorenzo, D. Yllanes
https://arxiv.org/abs/2506.16240
BoSS: Beyond-Semantic Speech
Qing Wang, Zehan Li, Hang Lv, Hongjie Chen, Yaodong Song, Jian Kang, Jie Lian, Jie Li, Yongxiang Li, Zhongjiang He, Xuelong Li
https://arxiv.org/abs/2507.17563
Validity of generalized Gibbs ensemble in a random matrix model with a global $\mathbb{Z}_2$-symmetry
Adway Kumar Das
https://arxiv.org/abs/2506.16176 http…
Application of Whisper in Clinical Practice: the Post-Stroke Speech Assessment during a Naming Task
Milena Davudova, Ziyuan Cai, Valentina Giunchiglia, Dragos C. Gruia, Giulia Sanguedolce, Adam Hampshire, Fatemeh Geranmayeh
https://arxiv.org/abs/2507.17326
Microstates : Do the outliers worth
Olivier Sire
https://arxiv.org/abs/2506.16080 https://arxiv.org/pdf/2506.16080
On Temporal Guidance and Iterative Refinement in Audio Source Separation
Tobias Morocutti, Jonathan Greif, Paul Primus, Florian Schmid, Gerhard Widmer
https://arxiv.org/abs/2507.17297
Unifying renormalized and bare viscosity in two-dimensional molecular dynamics simulations
Kazuma Yokota, Masato Itami, Shin-ichi Sasa
https://arxiv.org/abs/2506.16002
Weak Supervision Techniques towards Enhanced ASR Models in Industry-level CRM Systems
Zhongsheng Wang, Sijie Wang, Jia Wang, Yung-I Liang, Yuxi Zhang, Jiamou Liu
https://arxiv.org/abs/2507.16843
Generalized Spectral Statistics in the Kicked Ising model
Divij Gupta, Brian Swingle
https://arxiv.org/abs/2506.15816 https://arxiv.o…
[2025-07-24 Thu (UTC), 5 new articles found for cs.SD Sound]
toXiv_bot_toot
Breakdown of the thermodynamic limit in quantum spin and dimer models
Jeet Shah, Laura Shou, Jeremy Shuler, Victor Galitski
https://arxiv.org/abs/2506.15769
Replaced article(s) found for cs.SD. https://arxiv.org/list/cs.SD/new
[1/1]:
- Koel-TTS: Enhancing LLM based Speech Generation with Preference Alignment and Classifier Free Gui...
Hussain, Neekhara, Yang, Casanova, Ghosh, Desta, Fejgin, Valle, Li
Two Types of Temporal Symmetry in the Laws of Nature
A. Y. Klimenko
https://arxiv.org/abs/2506.15730 https://arxiv.org/pdf/2506.15730…
Universal Music Representations? Evaluating Foundation Models on World Music Corpora
Charilaos Papaioannou, Emmanouil Benetos, Alexandros Potamianos
https://arxiv.org/abs/2506.17055
Thermodynamics and Legendre Duality in Optimal Networks
Amilcare Porporato, Shashank Kumar Anand, Salvatore Calabrese, Luca Ridolfi, Lamberto Rondoni
https://arxiv.org/abs/2506.15727
ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors
Junghyun Koo, Marco A. Martinez-Ramirez, Wei-Hsiang Liao, Giorgio Fabbro, Michele Mancusi, Yuki Mitsufuji
https://arxiv.org/abs/2506.16889
[2025-06-23 Mon (UTC), 18 new articles found for cond-mat.stat-mech Statistical Mechanics]
toXiv_bot_toot
Hybrid-Sep: Language-queried audio source separation via pre-trained Model Fusion and Adversarial Diffusion Training
Jianyuan Feng, Guangzheng Li, Yangfei Xu
https://arxiv.org/abs/2506.16833
Replaced article(s) found for cond-mat.stat-mech. https://arxiv.org/list/cond-mat.stat-mech/new
[1/1]:
- Lectures on Statistical Mechanics
Allan N. Kaufman, Bruce I. Cohen, Alain J. Brizard
Learning Magnitude Distribution of Sound Fields via Conditioned Autoencoder
Shoichi Koyama, Kenji Ishizuka
https://arxiv.org/abs/2506.16729 https://…
Towards Bitrate-Efficient and Noise-Robust Speech Coding with Variable Bitrate RVQ
Yunkee Chae, Kyogu Lee
https://arxiv.org/abs/2506.16538 https://
Nonequilibrium orders in parametrically driven field theories
Carl Philipp Zelle, Romain Daviet, Andrew J. Millis, Sebastian Diehl
https://arxiv.org/abs/2506.18622
AeroGPT: Leveraging Large-Scale Audio Model for Aero-Engine Bearing Fault Diagnosis
Jiale Liu, Dandan Peng, Huan Wang, Chenyu Liu, Yan-Fu Li, Min Xie
https://arxiv.org/abs/2506.16225
Stability of universal properties against perturbations of the Markov Chain Monte Carlo algorithm
Matteo Bacci, Claudio Bonati
https://arxiv.org/abs/2506.18561
Improved Intelligibility of Dysarthric Speech using Conditional Flow Matching
Shoutrik Das, Nishant Singh, Arjun Gangwar, S Umesh
https://arxiv.org/abs/2506.16127
Two dimensional Coulomb gas in a non-conservative trap
David S. Dean, Rashed Aljasmi, Satya N. Majumdar
https://arxiv.org/abs/2506.18551 https://
VS-Singer: Vision-Guided Stereo Singing Voice Synthesis with Consistency Schr\"odinger Bridge
Zijing Zhao, Kai Wang, Hao Huang, Ying Hu, Liang He, Jichen Yang
https://arxiv.org/abs/2506.16020
An Extended Model of Fractional-Dimensional Space for Anisotropic Solids with Deformed Derivatives
Jos\'e Weberszpil
https://arxiv.org/abs/2506.18127 h…
Sonic4D: Spatial Audio Generation for Immersive 4D Scene Exploration
Siyi Xie, Hanxin Zhu, Tianyu He, Xin Li, Zhibo Chen
https://arxiv.org/abs/2506.15759 h…
Non-algebraic first return probability of a stretched random walk near a convex boundary and its effect on adsorption
Daniil Fedotov, Sergei Nechaev
https://arxiv.org/abs/2506.17829
Explainable speech emotion recognition through attentive pooling: insights from attention-based temporal localization
Tahitoa Leygue (DIASI), Astrid Sabourin (DIASI), Christian Bolzmacher (DIASI), Sylvain Bouchigny (DIASI), Margarita Anastassova (DIASI), Quoc-Cuong Pham (DIASI)
https://arxiv.org/abs/2506.15754
Wealth Thermalization Hypothesis
Klaus M. Frahm, Dima L. Shepelyansky
https://arxiv.org/abs/2506.17720 https://arxiv.org/pdf/2506.177…
[2025-06-23 Mon (UTC), 10 new articles found for cs.SD Sound]
toXiv_bot_toot
The free propagator of strongly anisotropic systems with free surfaces
M. A. Shpot
https://arxiv.org/abs/2506.17595 https://arxiv.org…
Replaced article(s) found for cs.SD. https://arxiv.org/list/cs.SD/new
[1/1]:
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative ...
Dominik Wagner, Ilja Baumann, Tobias Bocklet
Finite time path field theory and a new type of universal quantum spin chain quench behaviour
Domagoj Kui\'c, Alemka Knapp, Diana \v{S}aponja-Milutinovi\'c
https://arxiv.org/abs/2506.17402
USAD: Universal Speech and Audio Representation via Distillation
Heng-Jui Chang, Saurabhchand Bhati, James Glass, Alexander H. Liu
https://arxiv.org/abs/2506.18843
[2025-06-24 Tue (UTC), 8 new articles found for cond-mat.stat-mech Statistical Mechanics]
toXiv_bot_toot
MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners
Fang-Duo Tsai, Shih-Lun Wu, Weijaw Lee, Sheng-Ping Yang, Bo-Rui Chen, Hao-Chung Cheng, Yi-Hsuan Yang
https://arxiv.org/abs/2506.18729
Frequency-Weighted Training Losses for Phoneme-Level DNN-based Speech Enhancement
Nasser-Eddine Monir, Paul Magron, Romain Serizel
https://arxiv.org/abs/2506.18714
Evaluating Multichannel Speech Enhancement Algorithms at the Phoneme Scale Across Genders
Nasser-Eddine Monir, Paul Magron, Romain Serizel
https://arxiv.org/abs/2506.18691
TCDiff : An End-to-end Trajectory-Controllable Diffusion Model for Harmonious Music-Driven Group Choreography
Yuqin Dai, Wanlu Zhu, Ronghui Li, Xiu Li, Zhenyu Zhang, Jun Li, Jian Yang
https://arxiv.org/abs/2506.18671
Smooth Operators: LLMs Translating Imperfect Hints into Disfluency-Rich Transcripts
Duygu Altinok
https://arxiv.org/abs/2506.18510 https://
AI-Generated Song Detection via Lyrics Transcripts
Markus Frohmann, Elena V. Epure, Gabriel Meseguer-Brocal, Markus Schedl, Romain Hennequin
https://arxiv.org/abs/2506.18488
Selecting N-lowest scores for training MOS prediction models
Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko
https://arxiv.org/abs/2506.18326 htt…
Large-Scale Training Data Attribution for Music Generative Models via Unlearning
Woosung Choi, Junghyun Koo, Kin Wai Cheuk, Joan Serr\`a, Marco A. Mart\'inez-Ram\'irez, Yukara Ikemiya, Naoki Murata, Yuhta Takida, Wei-Hsiang Liao, Yuki Mitsufuji
https://arxiv.org/abs/2506.18312
Rethinking Mean Opinion Scores in Speech Quality Assessment: Aggregation through Quantized Distribution Fitting
Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko
https://arxiv.org/abs/2506.18307
JIS: A Speech Corpus of Japanese Idol Speakers with Various Speaking Styles
Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko
https://arxiv.org/abs/2506.18296
Human Voice is Unique
Rita Singh, Bhiksha Raj
https://arxiv.org/abs/2506.18182 https://arxiv.org/pdf/2506.18182
GD-Retriever: Controllable Generative Text-Music Retrieval with Diffusion Models
Julien Guinot, Elio Quinton, Gy\"orgy Fazekas
https://arxiv.org/abs/2506.17886
CultureMERT: Continual Pre-Training for Cross-Cultural Music Representation Learning
Angelos-Nikolaos Kanatas, Charilaos Papaioannou, Alexandros Potamianos
https://arxiv.org/abs/2506.17818
SLAP: Siamese Language-Audio Pretraining Without Negative Samples for Music Understanding
Julien Guinot, Alain Riou, Elio Quinton, Gy\"orgy Fazekas
https://arxiv.org/abs/2506.17815
Algebraic Structures in Microtonal Music
Veronica Flynn, Carmen Rovi
https://arxiv.org/abs/2506.17778 https://arxiv.org/pdf/2506.1777…
From Generality to Mastery: Composer-Style Symbolic Music Generation via Large-Scale Pre-training
Mingyang Yao, Ke Chen
https://arxiv.org/abs/2506.17497 ht…
Adaptive Control Attention Network for Underwater Acoustic Localization and Domain Adaptation
Quoc Thinh Vo, Joe Woods, Priontu Chowdhury, David K. Han
https://arxiv.org/abs/2506.17409
Zero-Shot Cognitive Impairment Detection from Speech Using AudioLLM
Mostafa Shahin, Beena Ahmed, Julien Epps
https://arxiv.org/abs/2506.17351 https://
[2025-06-24 Tue (UTC), 19 new articles found for cs.SD Sound]
toXiv_bot_toot
Replaced article(s) found for cs.SD. https://arxiv.org/list/cs.SD/new
[1/1]:
- TAPS: Throat and Acoustic Paired Speech Dataset for Deep Learning-Based Speech Enhancement
Yunsik Kim, Yonghun Song, Yoonyoung Chung