Locality-constrained linear coding based fused visual features for robust acoustic event classification

dc.contributor.authorMulimani, M.
dc.contributor.authorKoolagudi, G.K.
dc.date.accessioned2026-02-06T06:37:36Z
dc.date.issued2019
dc.description.abstractIn this paper, a novel Fused Visual Features (FVFs) are proposed for Acoustic Event Classification (AEC) in the meeting room and office environments. The codes of Visual Features (VFs) are evaluated from row vectors and Scale Invariant Feature Transform (SIFT) vectors of the grayscale Gammatonegram of an acoustic event separately using Locality-constrained Linear Coding (LLC). Further, VFs from row vectors and SIFT vectors of the grayscale Gammatonegram are fused to get FVFs. Performance of the proposed FVFs is evaluated on acoustic events of publicly available UPC-TALP and DCASE datasets in clean and noisy conditions. Results show that proposed FVFs are robust to noise and achieve overall recognition accuracy of 96.40% and 90.45% on UPC-TALP and DCASE datasets, respectively. © 2019 ISCA
dc.identifier.citationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2019, Vol.2019-September, , p. 2558-2562
dc.identifier.issn2308457X
dc.identifier.urihttps://doi.org/10.21437/Interspeech.2019-1421
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/31138
dc.publisherInternational Speech Communication Association
dc.subjectAcoustic Event Classification (AEC)
dc.subjectFused Visual Features (FVFs)
dc.subjectGammatonegram
dc.subjectLocality-constrained Linear Coding (LLC)
dc.subjectScale Invariant Feature Transform (SIFT)
dc.titleLocality-constrained linear coding based fused visual features for robust acoustic event classification

Files