Feature selection and model optimization for semi-supervised speaker spotting

Chetupalli, S.R.; Gopalakrishnan, A.; Sreenivas, T.V.

Feature selection and model optimization for semi-supervised speaker spotting

dc.contributor.author	Chetupalli, S.R.
dc.contributor.author	Gopalakrishnan, A.
dc.contributor.author	Sreenivas, T.V.
dc.date.accessioned	2026-02-06T06:39:01Z
dc.date.issued	2016
dc.description.abstract	We explore, experimentally, feature selection and optimization of stochastic model parameters for the problem of speaker spotting. Based on an initially identified segment of speech of a speaker, an iterative model refinement method is developed along with a latent variable mixture model so that segments of the same speaker are identified in a long speech record. It is found that a GMM with moderate number of mixtures is better suited for the task than a large number mixture model as used in speaker identification. Similarly, a PCA based low-dimensional projection of MFCC based feature vector provides better performance. We show that about 6 seconds of initially identified speaker data is sufficient to achieve > 90% performance of speaker segment identification. Â© 2016 IEEE.
dc.identifier.citation	European Signal Processing Conference, 2016, Vol.2016-November, , p. 135-139
dc.identifier.issn	22195491
dc.identifier.uri	https://doi.org/10.1109/EUSIPCO.2016.7760225
dc.identifier.uri	https://idr.nitk.ac.in/handle/123456789/32011
dc.publisher	European Signal Processing Conference, EUSIPCO
dc.subject	Gaussian mixture model (GMM)
dc.subject	Mel-frequency cepstral coefficients (MFCCs)
dc.subject	Speaker diarization
dc.subject	Speaker spotting
dc.subject	Speaker verification
dc.title	Feature selection and model optimization for semi-supervised speaker spotting

Collections

Conference Papers

Feature selection and model optimization for semi-supervised speaker spotting

Files

Collections