Feature selection and model optimization for semi-supervised speaker spotting
| dc.contributor.author | Chetupalli, S.R. | |
| dc.contributor.author | Gopalakrishnan, A. | |
| dc.contributor.author | Sreenivas, T.V. | |
| dc.date.accessioned | 2026-02-06T06:39:01Z | |
| dc.date.issued | 2016 | |
| dc.description.abstract | We explore, experimentally, feature selection and optimization of stochastic model parameters for the problem of speaker spotting. Based on an initially identified segment of speech of a speaker, an iterative model refinement method is developed along with a latent variable mixture model so that segments of the same speaker are identified in a long speech record. It is found that a GMM with moderate number of mixtures is better suited for the task than a large number mixture model as used in speaker identification. Similarly, a PCA based low-dimensional projection of MFCC based feature vector provides better performance. We show that about 6 seconds of initially identified speaker data is sufficient to achieve > 90% performance of speaker segment identification. © 2016 IEEE. | |
| dc.identifier.citation | European Signal Processing Conference, 2016, Vol.2016-November, , p. 135-139 | |
| dc.identifier.issn | 22195491 | |
| dc.identifier.uri | https://doi.org/10.1109/EUSIPCO.2016.7760225 | |
| dc.identifier.uri | https://idr.nitk.ac.in/handle/123456789/32011 | |
| dc.publisher | European Signal Processing Conference, EUSIPCO | |
| dc.subject | Gaussian mixture model (GMM) | |
| dc.subject | Mel-frequency cepstral coefficients (MFCCs) | |
| dc.subject | Speaker diarization | |
| dc.subject | Speaker spotting | |
| dc.subject | Speaker verification | |
| dc.title | Feature selection and model optimization for semi-supervised speaker spotting |
