
2025-09-17 10:55:20
TexTAR : Textual Attribute Recognition in Multi-domain and Multi-lingual Document Images
Rohan Kumar, Jyothi Swaroopa Jinka, Ravi Kiran Sarvadevabhatla
https://arxiv.org/abs/2509.13151
TexTAR : Textual Attribute Recognition in Multi-domain and Multi-lingual Document Images
Rohan Kumar, Jyothi Swaroopa Jinka, Ravi Kiran Sarvadevabhatla
https://arxiv.org/abs/2509.13151
RawTFNet: A Lightweight CNN Architecture for Speech Anti-spoofing
Yang Xiao, Ting Dang, Rohan Kumar Das
https://arxiv.org/abs/2507.08227 https://
ESDD 2026: Environmental Sound Deepfake Detection Challenge Evaluation Plan
Han Yin, Yang Xiao, Rohan Kumar Das, Jisheng Bai, Ting Dang
https://arxiv.org/abs/2508.04529 https://…
Machine-Learning-Assisted Photonic Device Development: A Multiscale Approach from Theory to Characterization
Yuheng Chen, Alexander Montes McNeil, Taehyuk Park, Blake A. Wilson, Vaishnavi Iyer, Michael Bezick, Jae-Ik Choi, Rohan Ojha, Pravin Mahendran, Daksh Kumar Singh, Geetika Chitturi, Peigang Chen, Trang Do, Alexander V. Kildishev, Vladimir M. Shalaev, Michael Moebius, Wenshan Cai, Yongmin Liu, Alexandra Boltasseva
Multilingual Source Tracing of Speech Deepfakes: A First Benchmark
Xi Xuan, Yang Xiao, Rohan Kumar Das, Tomi Kinnunen
https://arxiv.org/abs/2508.04143 https://
Replaced article(s) found for eess.AS. https://arxiv.org/list/eess.AS/new
[1/1]:
- DG-SED: Domain Generalization for Sound Event Detection with Heterogeneous Training Data
Yang Xiao, Han Yin, Jisheng Bai, Rohan Kumar Das