Acoustic scene classification using projection Kervolutional neural network

No Thumbnail Available

Date

2023

Journal Title

Journal ISSN

Volume Title

Publisher

Springer

Abstract

In this paper, a novel Projection Kervolutional Neural Network (ProKNN) is proposed for Acoustic Scene Classification (ASC). ProKNN is a combination of two special filters known as the left and right projection layers and Kervolutional Neural Network (KNN). KNN replaces the linearity of the Convolutional Neural Network (CNN) with a non-linear polynomial kernel. We extend the ProKNN to learn from the features of two channels of audio recordings in the initial stage. The performance of the ProKNN is evaluated on the two publicly available datasets: TUT Urban Acoustic Scenes 2018 and TUT Urban Acoustic Scenes Mobile 2018 development datasets. Results show that the proposed ProKNN outperforms the existing systems with an absolute improvement of accuracy of 8% and 14% on TUT Urban Acoustic Scenes 2018 and TUT Urban Acoustic Scenes Mobile 2018 development datasets respectively, as compared to the baseline model of Detection and Classification of Acoustic Scene and Events (DCASE) - 2018 challenge. © 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.

Description

Keywords

Audio recordings, Classification (of information), Convolutional neural networks, Acoustic scene classification, Convolutional neural network, Kervolutional neural network, Neural-networks, Non linear, Projection kervolutional neural network, Projection layer, Scene classification, Urban acoustics, Multilayer neural networks

Citation

Multimedia Tools and Applications, 2023, 82, 6, pp. 9447-9457

Collections

Endorsement

Review

Supplemented By

Referenced By