Feature selection and model optimization for semi-supervised speaker spotting

dc.contributor.authorChetupalli, S.R.
dc.contributor.authorGopalakrishnan, A.
dc.contributor.authorSreenivas, T.V.
dc.date.accessioned2026-02-06T06:39:01Z
dc.date.issued2016
dc.description.abstractWe explore, experimentally, feature selection and optimization of stochastic model parameters for the problem of speaker spotting. Based on an initially identified segment of speech of a speaker, an iterative model refinement method is developed along with a latent variable mixture model so that segments of the same speaker are identified in a long speech record. It is found that a GMM with moderate number of mixtures is better suited for the task than a large number mixture model as used in speaker identification. Similarly, a PCA based low-dimensional projection of MFCC based feature vector provides better performance. We show that about 6 seconds of initially identified speaker data is sufficient to achieve > 90% performance of speaker segment identification. © 2016 IEEE.
dc.identifier.citationEuropean Signal Processing Conference, 2016, Vol.2016-November, , p. 135-139
dc.identifier.issn22195491
dc.identifier.urihttps://doi.org/10.1109/EUSIPCO.2016.7760225
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/32011
dc.publisherEuropean Signal Processing Conference, EUSIPCO
dc.subjectGaussian mixture model (GMM)
dc.subjectMel-frequency cepstral coefficients (MFCCs)
dc.subjectSpeaker diarization
dc.subjectSpeaker spotting
dc.subjectSpeaker verification
dc.titleFeature selection and model optimization for semi-supervised speaker spotting

Files