Pandey, G.Koolagudi, S.G.2026-02-032025Circuits, Systems, and Signal Processing, 2025, , , pp. -0278081Xhttps://doi.org/10.1007/s00034-025-03314-3https://idr.nitk.ac.in/handle/123456789/20631Acoustic event detection (AED) or sound event detection (SED) is a problem that focuses on automatically detecting acoustic events in an audio recording along with their onset and offset times. Rare acoustic event detection in AED is a challenging problem. Rare AED aims to detect rare but significant sound events in an audio signal. Traditional methods used for SED often struggle to accurately detect rare sound events due to their infrequent occurrence and diverse characteristics. This paper introduces novel features named as multi-resolution cochleagrams (MRCGs) for rare SED tasks. Different cochleagrams with different resolutions are extracted from the audio recording and stacked to get the MRCG feature vector. The equivalent rectangular bandwidth (ERB) scale used in the cochleagram simulates the human auditory filter. The classifier used is a convolutional recurrent neural network (CRNN) embedded with an attention module. This work considers the Task 2 DCASE 2017 dataset for detecting rare sound events. Results show that the proposed MRCG and CRNN with attention combination improves the performance. The proposed method achieved an average error rate of 0.11 and an average F1 score of 94.3%. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2025.Acoustic measuring instrumentsAudio acousticsAudio recordingsAuditionDeep neural networksFeature extractionGearsAcoustic event detectionsCochleagramDCASEDeep learningMulti-resolution cochleagramNeural-networksRare sound event detectionSound event detectionTime-frequency AnalysisConvolutionRare Sound Event Detection Using Multi-resolution Cochleagram Features and CRNN with Attention Mechanism