Robust acoustic event classification using bag-of-visual-words

dc.contributor.authorMulimani, M.
dc.contributor.authorKoolagudi, S.G.
dc.date.accessioned2026-02-06T06:38:24Z
dc.date.issued2018
dc.description.abstractThis paper presents a novel Bag-of-Visual-Words (BoVW) approach, to represent the grayscale spectrograms of acoustic events. Such, BoVW representations are referred as histograms of visual features, used for Acoustic Event Classification (AEC). Further, Chi-square distance between histograms of visual features evaluated, which generates kernel to Support Vector Machines (Chi-square SVM) classifier. Evaluation of the proposed histograms of visual features together with Chi-square SVM classifier is conducted on different categories of acoustic events from UPC-TALP corpora in clean and different noise conditions. Results show that proposed approach is more robust to noise and achieves improved recognition accuracy compared to other methods. © 2018 International Speech Communication Association. All rights reserved.
dc.identifier.citationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2018, Vol.2018-September, , p. 3319-3322
dc.identifier.issn2308457X
dc.identifier.urihttps://doi.org/10.21437/Interspeech.2018-1905
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/31628
dc.publisherInternational Speech Communication Association publication@isca-speech.org 4 Rue des Fauvettes - Lous Tourils Baixas 66390
dc.subjectAcoustic Event Classification (AEC)
dc.subjectBag-of-Visual-Words (BoVW)
dc.subjectChi-square kernel SVM
dc.subjectHistograms of visual features
dc.titleRobust acoustic event classification using bag-of-visual-words

Files