Efficient audio segmentation in soccer videos

Raghuram, M.A.; Chavan, N.R.; Koolagudi, S.G.; Ramteke, P.B.

Efficient audio segmentation in soccer videos

dc.contributor.author	Raghuram, M.A.
dc.contributor.author	Chavan, N.R.
dc.contributor.author	Koolagudi, S.G.
dc.contributor.author	Ramteke, P.B.
dc.date.accessioned	2026-02-06T06:39:02Z
dc.date.issued	2016
dc.description.abstract	Identifying different audio segments in videos is the first step for many important tasks such as event detection and speech transcription. Approaches using Mel-Frequency Cepstral coefficients (MFCCs) with Gaussian mixture models (GMMs) and hidden Markov models (HMMs) perform reasonably well in stationary conditions but do not scale to a broad range of environmental conditions. This paper focuses on the audio segmentation in broadcast soccer videos into audio classes such as Silence, Speech Only, Speech Over Crowd, Crowd Only and Excited, with an alternative feature set which is simplistic as well as robust to changes in the environment conditions. Support Vector Machines (SVMs), Neural Networks and Random Forest are used for the classification. The accuracy achieved with SVMs, Neural Networks and Random Forest are 83.80%, 86.07%, and 88.35% respectively. The proposed features and Random Forest classifier are found to achieve better accuracy compared to the other classifiers. Â© 2016 IEEE.
dc.identifier.citation	Canadian Conference on Electrical and Computer Engineering, 2016, Vol.2016-October, , p. -
dc.identifier.issn	8407789
dc.identifier.uri	https://doi.org/10.1109/CCECE.2016.7726616
dc.identifier.uri	https://idr.nitk.ac.in/handle/123456789/32029
dc.publisher	Institute of Electrical and Electronics Engineers Inc.
dc.subject	Audio Segmentation
dc.subject	Machine learning
dc.subject	Random Forest Classifier
dc.subject	Soccer Videos
dc.subject	Spectral Features
dc.title	Efficient audio segmentation in soccer videos

Collections

Conference Papers

Efficient audio segmentation in soccer videos

Files

Collections