NFL news roundup: Texans, guard Wyatt Teller agree to terms on two-year deal https://www.nfl.com/news/nfl-news-roundup-latest-league-updates-from-tuesday-march-17
- AWS Textract: $0.0015 (Detect Text) / $0.015 (Tables/Forms)
- Azure Doc Intelligence: $0.00125 (Read) / $0.01 (structured)
That's ~23x cheaper than cloud OCR for basic text extraction — and up to 140x cheaper compared to structured extraction tiers. 📊
Processing 1,000 PDFs (32,000 pages): $2.32 vs ~$48 (cloud basic OCR) vs $320 (cloud structured).
The Substance III 🧪
某种物质 III 🧪
📷 Nikon F4E
🎞️ Harman Switch Azure (FF)
If you like my work, Support by buying me a coffee or a roll of film from PayPal https://www.paypal.com/paypalme/ydcdingsite
Eagles land speedy WR Brown, source says https://www.espn.com/nfl/story/_/id/48231996/source-eagles-wr-hollywood-brown-agree-one-year-deal
Sources: Amazon is in advanced talks to acquire satellite operator Globalstar in a deal that could be announced as soon as Tuesday; GSAT jumps 15% pre-market (Bloomberg)
https://www.bloomberg.com/news/articles/2026-04-14/am…
Digital unterstützte Parkraumkontrolle: Von der Vorbereitung zum Regelbetrieb - #Scancars
von @…
Replaced article(s) found for eess.AS. https://arxiv.org/list/eess.AS/new
[1/1]:
- Unifying Diarization, Separation, and ASR with Multi-Speaker Encoder
Muhammad Shakeel, Yui Sudo, Yifan Peng, Chyi-Jiunn Lin, Shinji Watanabe
https://arxiv.org/abs/2508.20474 https://mastoxiv.page/@arXiv_eessAS_bot/115110974009150613
- CALM: Joint Contextual Acoustic-Linguistic Modeling for Personalization of Multi-Speaker ASR
Muhammad Shakeel, Yosuke Fukumoto, Chikara Maeda, Chyi-Jiunn Lin, Shinji Watanabe
https://arxiv.org/abs/2601.22792 https://mastoxiv.page/@arXiv_eessAS_bot/116000207024295325
- How Much Does Machine Identity Matter in Anomalous Sound Detection at Test Time?
Kevin Wilkinghoff, Keisuke Imoto, Zheng-Hua Tan
https://arxiv.org/abs/2602.16253 https://mastoxiv.page/@arXiv_eessAS_bot/116096185732811365
- LMU-Based Sequential Learning and Posterior Ensemble Fusion for Cross-Domain Infant Cry Classific...
Niloofar Jazaeri, Hilmi R. Dajani, Marco Janeczek, Martin Bouchard
https://arxiv.org/abs/2603.02245 https://mastoxiv.page/@arXiv_eessAS_bot/116169771215037748
- Adapting a Text-to-Audio Model for Room Impulse Response Generation
Kirak Kim, Sungyoung Kim
https://arxiv.org/abs/2603.09708 https://mastoxiv.page/@arXiv_eessAS_bot/116209762413602825
- Repurposing Image Diffusion Models for Training-Free Music Style Transfer on Mel-spectrograms
Heehwan Wang, Joonwoo Kwon, Sooyoung Kim, Jungwoo Seo, Shinjae Yoo, Yuewei Lin, Jiook Cha
https://arxiv.org/abs/2411.15913 https://mastoxiv.page/@arXiv_csSD_bot/113548024475383386
- DeePen: Penetration Testing for Audio Deepfake Detection
M\"uller, Kawa, Stan, Doan, Jung, Choong, Sperl, B\"ottinger
https://arxiv.org/abs/2502.20427 https://mastoxiv.page/@arXiv_csCR_bot/114097333876265997
- Re-evaluating Minimum Bayes Risk Decoding for Automatic Speech Recognition
Yuu Jinnai
https://arxiv.org/abs/2510.19471 https://mastoxiv.page/@arXiv_csCL_bot/115422969877240889
- Aliasing-Free Neural Audio Synthesis
Yicheng Gu, Junan Zhang, Chaoren Wang, Jerry Li, Zhizheng Wu, Lauri Juvela
https://arxiv.org/abs/2512.20211 https://mastoxiv.page/@arXiv_csSD_bot/115773521971327576
- TiCo: Time-Controllable Spoken Dialogue Model
Kai-Wei Chang, Wei-Chih Chen, En-Pei Hu, Hung-yi Lee, James Glass
https://arxiv.org/abs/2603.22267 https://mastoxiv.page/@arXiv_csCL_bot/116283643505371784
toXiv_bot_toot
The Substance II 🧪
某种物质 II 🧪
📷 Nikon F4E
🎞️ Harman Switch Azure (FF)
If you like my work, Support by buying me a coffee or a roll of film from PayPal https://www.paypal.com/paypalme/ydcdingsite
Eagles, TE Dallas Goedert agree to terms on one-year deal https://www.nfl.com/news/eagles-te-dallas-goedert-agree-to-terms-on-one-year-deal
The Substance 🧪
某种物质 🧪
📷 Nikon F4E
🎞️ Harman Switch Azure (FF)
If you like my work, Support by buying me a coffee or a roll of film from PayPal https://www.paypal.com/paypalme/ydcdingsite