Polyphonic sound event detection using transposed convolutional recurrent neural network

Chatterjee, C.C.; Mulimani, M.; Koolagudi, S.G.

Polyphonic sound event detection using transposed convolutional recurrent neural network

dc.contributor.author	Chatterjee, C.C.
dc.contributor.author	Mulimani, M.
dc.contributor.author	Koolagudi, S.G.
dc.date.accessioned	2026-02-06T06:36:48Z
dc.date.issued	2020
dc.description.abstract	In this paper we propose a Transposed Convolutional Recurrent Neural Network (TCRNN) architecture for polyphonic sound event recognition. Transposed convolution layer, which caries out a regular convolution operation but reverts the spatial transformation and it is combined with a bidirectional Recurrent Neural Network (RNN) to get TCRNN. Instead of the traditional mel spectrogram features, the proposed methodology incorporates mel-IFgram (Instantaneous Frequency spectrogram) features. The performance of the proposed approach is evaluated on sound events of publicly available TUT-SED 2016 and Joint sound scene and polyphonic sound event recognition datasets. Results show that the proposed approach outperforms state-of-the-art methods. Â© 2020 Institute of Electrical and Electronics Engineers Inc.. All rights reserved.
dc.identifier.citation	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2020, Vol.2020-May, , p. 661-665
dc.identifier.issn	07367791; 15206149
dc.identifier.uri	https://doi.org/10.1109/ICASSP40776.2020.9054628
dc.identifier.uri	https://idr.nitk.ac.in/handle/123456789/30696
dc.publisher	Institute of Electrical and Electronics Engineers Inc.
dc.subject	Convolution Neural Networks (CNN)
dc.subject	Deep Neural Networks (DNN)
dc.subject	Instantaneous Frequency spectrogram (IFgram)
dc.subject	Recurrent Neural Networks (RNN)
dc.subject	Sound Event Detection (SED)
dc.subject	Transposed CNN (TCNN)
dc.title	Polyphonic sound event detection using transposed convolutional recurrent neural network

Collections

Conference Papers

Polyphonic sound event detection using transposed convolutional recurrent neural network

Files

Collections