Efficient Scaling for LLM-based ASR
Bingshen Mu, Yiwen Shao, Kun Wei, Dong Yu, Lei Xie
https://arxiv.org/abs/2508.04096 https://arxiv.org/pdf/2508.04096
Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems
Bo Ren, Yu Shi, Jinyu Li
https://arxiv.org/abs/2506.06252 https://
Satellite-based Rabi rice paddy field mapping in India: a case study on Telangana state
Prashanth Reddy Putta (University of Pavia), Fabio Dell'Acqua (University of Pavia)
https://arxiv.org/abs/2507.05189
Deepfakes in Criminal Investigations: Interdisciplinary Research Directions for CMC Research
Lorenz Meinen, Astrid Schom\"acker, Stefanie Wiedemann, Markus Hartmann, Timo Speith, Lena K\"astner, Niklas K\"uhl, Christian R\"uckert
https://arxiv.org/abs/2507.03457
A Deep Unfolding Framework for Diffractive Snapshot Spectral Imaging
Zhengyue Zhuge, Jiahui Xu, Shiqi Chen, Hao Xu, Yueting Chen, Zhihai Xu, Huajun Feng
https://arxiv.org/abs/2507.04622
LUST: A Multi-Modal Framework with Hierarchical LLM-based Scoring for Learned Thematic Significance Tracking in Multimedia Content
Anderson de Lima Luiz
https://arxiv.org/abs/2508.04353
Mind the Gap: From Resolving Theoretical Foundations of Chiral(ity)-Induced Spin Selectivity to Pioneering Implementations in Quantum Sensing
Yan Xi Foo, Aisha Kermiche, Farhan T. Chowdhury, Clarice D. Aiello, Luke D. Smith
https://arxiv.org/abs/2508.05611
SAGE-HLS: Syntax-Aware AST-Guided LLM for High-Level Synthesis Code Generation
M Zafir Sadik Khan, Nowfel Mashnoor, Mohammad Akyash, Kimia Azar, Hadi Kamali
https://arxiv.org/abs/2508.03558
JPS: Jailbreak Multimodal Large Language Models with Collaborative Visual Perturbation and Textual Steering
Renmiao Chen, Shiyao Cui, Xuancheng Huang, Chengwei Pan, Victor Shea-Jay Huang, QingLin Zhang, Xuan Ouyang, Zhexin Zhang, Hongning Wang, Minlie Huang
https://arxiv.org/abs/2508.05087
Diarization-Aware Multi-Speaker Automatic Speech Recognition via Large Language Models
Yuke Lin, Ming Cheng, Ze Li, Beilong Tang, Ming Li
https://arxiv.org/abs/2506.05796