Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_qbioQM_bot@mastoxiv.page
2025-07-24 12:53:58

Replaced article(s) found for q-bio.QM. arxiv.org/list/q-bio.QM/new
[1/1]:
- Comparative analysis of computational approaches for predicting Transthyretin (TTR) transcription...
Mariya L. Ivanova, Nicola Russo, Gueorgui Mihaylov, Konstantin Nikolic

@arXiv_eessAS_bot@mastoxiv.page
2025-07-24 08:00:59

Towards Robust Speech Recognition for Jamaican Patois Music Transcription
Jordan Madden, Matthew Stone, Dimitri Johnson, Daniel Geddez
arxiv.org/abs/2507.16834

@arXiv_physicshistph_bot@mastoxiv.page
2025-07-24 08:50:00

Slow neutrons in Palermo: a forgotten conference by Enrico Fermi
Emanuele Goldoni, Ledo Stefanini
arxiv.org/abs/2507.16928 arxiv.org/pdf/25…

@arXiv_csSD_bot@mastoxiv.page
2025-08-20 07:44:00

Is Transfer Learning Necessary for Violin Transcription?
Yueh-Po Peng, Ting-Kang Wang, Li Su, Vincent K. M. Cheung
arxiv.org/abs/2508.13516

@arXiv_csCL_bot@mastoxiv.page
2025-08-20 07:46:40

Overcoming Latency Bottlenecks in On-Device Speech Translation: A Cascaded Approach with Alignment-Based Streaming MT
Zeeshan Ahmed, Frank Seide, Niko Moritz, Ju Lin, Ruiming Xie, Simone Merello, Zhe Liu, Christian Fuegen
arxiv.org/abs/2508.13358

@arXiv_csCC_bot@mastoxiv.page
2025-08-20 08:25:00

Analog computation with transcriptional networks
David Doty, Mina Latifi, David Soloveichick
arxiv.org/abs/2508.14017 arxiv.org/pdf/2508.14…

@arXiv_mathNA_bot@mastoxiv.page
2025-07-18 07:56:52

Keep the beat going: Automatic drum transcription with momentum
Alisha L. Foster, Robert J. Webber
arxiv.org/abs/2507.12596

@arXiv_csRO_bot@mastoxiv.page
2025-08-18 09:28:50

A Comparative Study of Floating-Base Space Parameterizations for Agile Whole-Body Motion Planning
Evangelos Tsiatsianas, Chairi Kiourt, Konstantinos Chatzilygeroudis
arxiv.org/abs/2508.11520

@arXiv_csSD_bot@mastoxiv.page
2025-06-19 08:35:53

Exploiting Music Source Separation for Automatic Lyrics Transcription with Whisper
Jaza Syed, Ivan Meresman Higgs, Ond\v{r}ej C\'ifka, Mark Sandler
arxiv.org/abs/2506.15514

@arXiv_csHC_bot@mastoxiv.page
2025-07-08 12:28:30

Dude, where's my utterance? Evaluating the effects of automatic segmentation and transcription on CPS detection
Videep Venkatesha, Mariah Bradford, Nathaniel Blanchard
arxiv.org/abs/2507.04454

@arXiv_csSD_bot@mastoxiv.page
2025-06-18 08:45:12

Fretting-Transformer: Encoder-Decoder Model for MIDI to Tablature Transcription
Anna Hamberger, Sebastian Murgul, Jochen Schmidt, Michael Heizmann
arxiv.org/abs/2506.14223

@arXiv_csDL_bot@mastoxiv.page
2025-07-08 07:49:20

An HTR-LLM Workflow for High-Accuracy Transcription and Analysis of Abbreviated Latin Court Hand
Joshua D. Isom
arxiv.org/abs/2507.04132

@arXiv_csCL_bot@mastoxiv.page
2025-07-14 09:59:12

The Impact of Automatic Speech Transcription on Speaker Attribution
Cristina Aggazzotti, Matthew Wiesner, Elizabeth Allyn Smith, Nicholas Andrews
arxiv.org/abs/2507.08660

@arXiv_csCY_bot@mastoxiv.page
2025-06-11 07:28:33

Addressing Pitfalls in Auditing Practices of Automatic Speech Recognition Technologies: A Case Study of People with Aphasia
Katelyn Xiaoying Mei, Anna Seo Gyeong Choi, Hilke Schellmann, Mona Sloane, Allison Koenecke
arxiv.org/abs/2506.08846

@arXiv_eessAS_bot@mastoxiv.page
2025-08-20 10:32:06

Crosslisted article(s) found for eess.AS. arxiv.org/list/eess.AS/new
[1/1]:
- Is Transfer Learning Necessary for Violin Transcription?
Yueh-Po Peng, Ting-Kang Wang, Li Su, Vincent K. M. Cheung

@arXiv_csCL_bot@mastoxiv.page
2025-08-14 09:50:22

Assessing the Feasibility of Lightweight Whisper Models for Low-Resource Urdu Transcription
Abdul Rehman Antall, Naveed Akhtar
arxiv.org/abs/2508.09865

@arXiv_physicsbioph_bot@mastoxiv.page
2025-06-12 14:29:51

Replaced article(s) found for physics.bio-ph. arxiv.org/list/physics.bio-ph/
[1/1]:
Design principles of transcription factors with intrinsically disordered regions

@arXiv_csAI_bot@mastoxiv.page
2025-07-29 18:02:41

Replaced article(s) found for cs.AI. arxiv.org/list/cs.AI/new
[7/8]:
- Grid-LOGAT: Grid Based Local and Global Area Transcription for Video Question Answering
Chowdhury, Aukkapinyo, Fujimura, Woo, Wasusatein, Ghourabi

@arXiv_csSD_bot@mastoxiv.page
2025-07-17 09:03:20

RUMAA: Repeat-Aware Unified Music Audio Analysis for Score-Performance Alignment, Transcription, and Mistake Detection
Sungkyun Chang, Simon Dixon, Emmanouil Benetos
arxiv.org/abs/2507.12175

@arXiv_csFL_bot@mastoxiv.page
2025-07-01 07:34:53

Programmable Co-Transcriptional Splicing: Realizing Regular Languages via Hairpin Deletion
Da-Jung Cho, Szil\'ard Zsolt Fazekas, Shinnosuke Seki, Max Wiedenh\"oft
arxiv.org/abs/2506.23384

@arXiv_qbioMN_bot@mastoxiv.page
2025-07-08 09:45:50

Fast decisions with biophysically constrained gene promoter architectures
Tarek Tohme, Massimo Vergassola, Thierry Mora, Aleksandra M. Walczak
arxiv.org/abs/2507.03720

@arXiv_csSD_bot@mastoxiv.page
2025-08-12 10:35:23

Joint Transcription of Acoustic Guitar Strumming Directions and Chords
Sebastian Murgul, Johannes Schimper, Michael Heizmann
arxiv.org/abs/2508.07973

@arXiv_csSD_bot@mastoxiv.page
2025-06-16 08:04:09

Enabling automatic transcription of child-centered audio recordings from real-world environments
Daniil Kocharov, Okko R\"as\"anen
arxiv.org/abs/2506.11747

@arXiv_qbioQM_bot@mastoxiv.page
2025-06-03 07:58:28

Comparative analysis of computational approaches for predicting Transthyretin transcription activators and human dopamine D1 receptor antagonists
Mariya L. Ivanova, Nicola Russo, Konstantin Nikolic
arxiv.org/abs/2506.01137

@arXiv_eessAS_bot@mastoxiv.page
2025-08-12 10:12:53

Score-Informed BiLSTM Correction for Refining MIDI Velocity in Automatic Piano Transcription
Zhanhong He (David), Roberto Togneri (David), Defeng (David), Huang
arxiv.org/abs/2508.07757

@arXiv_physicsbioph_bot@mastoxiv.page
2025-07-04 09:18:51

Modelling transcriptional silencing and its coupling to 3D genome organisation
Massimiliano Semeraro, Giuseppe Negro, Davide Marenduzzo, Giada Forte
arxiv.org/abs/2507.02150

@arXiv_csSD_bot@mastoxiv.page
2025-08-12 10:37:23

Exploring Procedural Data Generation for Automatic Acoustic Guitar Fingerpicking Transcription
Sebastian Murgul, Michael Heizmann
arxiv.org/abs/2508.07987

@arXiv_csHC_bot@mastoxiv.page
2025-07-25 07:41:11

A Custom-Built Ambient Scribe Reduces Cognitive Load and Documentation Burden for Telehealth Clinicians
Justin Morse, Kurt Gilbert, Kyle Shin, Rick Cooke, Peyton Rose, Jack Sullivan, Angelo Sisante
arxiv.org/abs/2507.17754

@arXiv_csSD_bot@mastoxiv.page
2025-06-17 10:10:41

Methods for pitch analysis in contemporary popular music: multiple pitches from harmonic tones in Vitalic's music
Emmanuel Deruty, David Meredith, Maarten Grachten, Pascal Arbez-Nicolas, Andreas Hasselholt J{\o}rgensen, Oliver S{\o}nderm{\o}lle Hansen, Magnus Stensli, Christian N{\o}rk{\ae}r Petersen
arxiv.org/abs/25…

@arXiv_csDL_bot@mastoxiv.page
2025-07-28 07:59:01

Comparing OCR Pipelines for Folkloristic Text Digitization
Octavian M. Machidon, Alina L. Machidon
arxiv.org/abs/2507.19092 arxiv.org/pdf/2…

@arXiv_csSD_bot@mastoxiv.page
2025-06-16 07:53:59

Assessing the Impact of Anisotropy in Neural Representations of Speech: A Case Study on Keyword Spotting
Guillaume Wisniewski (LLF - UMR7110), S\'everine Guillaume (LACITO), Clara Rosina Fern\'andez (LACITO)
arxiv.org/abs/2506.11096

@arXiv_physicsbioph_bot@mastoxiv.page
2025-07-02 08:38:40

Topological weight and structural diversity of polydisperse chromatin loop networks
Andrea Bonato, Enrico Carlon, Sergey Kitaev, Davide Marenduzzo, Enzo Orlandini
arxiv.org/abs/2507.00520

@arXiv_eessAS_bot@mastoxiv.page
2025-08-07 08:30:24

LCS-CTC: Leveraging Soft Alignments to Enhance Phonetic Transcription Robustness
Zongli Ye, Jiachen Lian, Akshaj Gupta, Xuanru Zhou, Krish Patel, Haodong Li, Hwi Joo Park, Chenxu Guo, Shuhe Li, Sam Wang, Cheol Jun Cho, Zoe Ezzes, Jet M. J. Vonk, Brittany T. Morin, Rian Bogley, Lisa Wauters, Zachary A. Miller, Maria Luisa Gorno-Tempini, Gopala Anumanchipalli

@arXiv_csSD_bot@mastoxiv.page
2025-08-15 09:07:22

Motive-level Analysis of Form-functions Association in Korean Folk song
Danbinaerin Han, Dasaem Jeong, Juhan Nam
arxiv.org/abs/2508.10472 a…

@arXiv_csSD_bot@mastoxiv.page
2025-07-10 08:13:31

STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation
Wenxiang Guo, Yu Zhang, Changhao Pan, Zhiyuan Zhu, Ruiqi Li, Zhetao Chen, Wenhao Xu, Fei Wu, Zhou Zhao
arxiv.org/abs/2507.06670

@arXiv_eessAS_bot@mastoxiv.page
2025-08-12 09:39:03

A Survey on Non-Intrusive ASR Refinement: From Output-Level Correction to Full-Model Distillation
Mohammad Reza Peyghan, Fatemeh Rajabi, Saman Soleimani Roudi, Saeedreza Zouashkiani, Sajjad Amini, Shahrokh Ghaemmaghami
arxiv.org/abs/2508.07285

@arXiv_csSD_bot@mastoxiv.page
2025-08-08 09:07:32

SPGISpeech 2.0: Transcribed multi-speaker financial audio for speaker-tagged transcription
Raymond Grossman, Taejin Park, Kunal Dhawan, Andrew Titus, Sophia Zhi, Yulia Shchadilova, Weiqing Wang, Jagadeesh Balam, Boris Ginsburg
arxiv.org/abs/2508.05554

@arXiv_physicsbioph_bot@mastoxiv.page
2025-05-27 13:48:26

This arxiv.org/abs/2404.19158 has been replaced.
initial toot: mastoxiv.page/@arX…

@arXiv_eessAS_bot@mastoxiv.page
2025-06-04 07:34:43

Enhancing Lyrics Transcription on Music Mixtures with Consistency Loss
Jiawen Huang, Felipe Sousa, Emir Demirel, Emmanouil Benetos, Igor Gadelha
arxiv.org/abs/2506.02339

@arXiv_csSD_bot@mastoxiv.page
2025-08-12 08:26:13

Whisfusion: Parallel ASR Decoding via a Diffusion Transformer
Taeyoun Kwon, Junhyuk Ahn, Taegeun Yun, Heeju Jwa, Yoonchae Choi, Siwon Park, Nam-Joon Kim, Jangchan Kim, Hyun Gon Ryu, Hyuk-Jae Lee
arxiv.org/abs/2508.07048

@arXiv_csSD_bot@mastoxiv.page
2025-08-11 09:37:39

Improved Dysarthric Speech to Text Conversion via TTS Personalization
P\'eter Mihajlik, \'Eva Sz\'ekely, Piroska Barta, M\'at\'e Soma K\'ad\'ar, Gergely Dobsinszki, L\'aszl\'o T\'oth
arxiv.org/abs/2508.06391

@arXiv_eessAS_bot@mastoxiv.page
2025-06-09 08:03:02

Diarization-Aware Multi-Speaker Automatic Speech Recognition via Large Language Models
Yuke Lin, Ming Cheng, Ze Li, Beilong Tang, Ming Li
arxiv.org/abs/2506.05796

@arXiv_csSD_bot@mastoxiv.page
2025-08-11 09:33:19

SpeakerLM: End-to-End Versatile Speaker Diarization and Recognition with Multimodal Large Language Models
Han Yin, Yafeng Chen, Chong Deng, Luyao Cheng, Hui Wang, Chao-Hong Tan, Qian Chen, Wen Wang, Xiangang Li
arxiv.org/abs/2508.06372

@arXiv_csSD_bot@mastoxiv.page
2025-07-11 09:29:21

Edge-ASR: Towards Low-Bit Quantization of Automatic Speech Recognition Models
Chen Feng, Yicheng Lin, Shaojie Zhuo, Chenzheng Su, Ramchalam Kinattinkara Ramakrishnan, Zhaocong Yuan, Xiaopeng Zhang
arxiv.org/abs/2507.07877

@arXiv_eessAS_bot@mastoxiv.page
2025-06-26 09:28:50

Lightweight Target-Speaker-Based Overlap Transcription for Practical Streaming ASR
Ale\v{s} Pra\v{z}\'ak, Marie Kune\v{s}ov\'a, Josef Psutka
arxiv.org/abs/2506.20288

@arXiv_eessAS_bot@mastoxiv.page
2025-06-03 07:59:36

DNCASR: End-to-End Training for Speaker-Attributed ASR
Xianrui Zheng, Chao Zhang, Philip C. Woodland
arxiv.org/abs/2506.01916

@arXiv_csSD_bot@mastoxiv.page
2025-06-03 07:27:19

Improving Code Switching with Supervised Fine Tuning and GELU Adapters
Linh Pham
arxiv.org/abs/2506.00291 arxiv.org/p…

@arXiv_csSD_bot@mastoxiv.page
2025-07-02 08:27:39

Beat and Downbeat Tracking in Performance MIDI Using an End-to-End Transformer Architecture
Sebastian Murgul, Michael Heizmann
arxiv.org/abs/2507.00466

@arXiv_eessAS_bot@mastoxiv.page
2025-07-01 08:30:33

Investigating an Overfitting and Degeneration Phenomenon in Self-Supervised Multi-Pitch Estimation
Frank Cwitkowitz, Zhiyao Duan
arxiv.org/abs/2506.23371

@arXiv_csSD_bot@mastoxiv.page
2025-07-01 09:47:03

You Sound a Little Tense: L2 Tailored Clear TTS Using Durational Vowel Properties
Paige Tutt\"os\'i, H. Henny Yeung, Yue Wang, Jean-Julien Aucouturier, Angelica Lim
arxiv.org/abs/2506.23367