Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modelingWenmiao Gao, Yang Xiaohttps://arxiv.org/abs/2506.13455
Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modelingPre-training methods have achieved significant performance improvements in sound event localization and detection (SELD) tasks, but existing Transformer-based models suffer from high computational complexity. In this work, we propose a stereo sound event localization and detection system based on pre-trained PSELDnet and bidirectional Mamba sequence modeling. We replace the Conformer module with a BiMamba module and introduce asymmetric convolutions to more effectively model the spatiotemporal …