Robust acoustic event classification using bag-of-visual-words

No Thumbnail Available

Date

2018

Authors

Mulimani, M.
Koolagudi, S.G.

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

This paper presents a novel Bag-of-Visual-Words (BoVW) approach, to represent the grayscale spectrograms of acoustic events. Such, BoVW representations are referred as histograms of visual features, used for Acoustic Event Classification (AEC). Further, Chi-square distance between histograms of visual features evaluated, which generates kernel to Support Vector Machines (Chi-square SVM) classifier. Evaluation of the proposed histograms of visual features together with Chi-square SVM classifier is conducted on different categories of acoustic events from UPC-TALP corpora in clean and different noise conditions. Results show that proposed approach is more robust to noise and achieves improved recognition accuracy compared to other methods. � 2018 International Speech Communication Association. All rights reserved.

Description

Keywords

Citation

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2018, Vol.2018-September, , pp.3319-3322

Endorsement

Review

Supplemented By

Referenced By