Please use this identifier to cite or link to this item:
https://idr.nitk.ac.in/jspui/handle/123456789/8006
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Chetupalli, S.R. | |
dc.contributor.author | Sreenivas, T.V. | |
dc.contributor.author | Gopalakrishnan, A. | |
dc.date.accessioned | 2020-03-30T10:03:19Z | - |
dc.date.available | 2020-03-30T10:03:19Z | - |
dc.date.issued | 2019 | |
dc.identifier.citation | 25th National Conference on Communications, NCC 2019, 2019, Vol., , pp.- | en_US |
dc.identifier.uri | http://idr.nitk.ac.in/jspui/handle/123456789/8006 | - |
dc.description.abstract | Segment clustering is a crucial step in unsupervised speaker diarization. Bottom-up approaches, such as, hierarchical agglomerative clustering technique are used traditionally for segment clustering. In this paper, we consider the top-down approach to clustering, in which a speaker sensitive, low-dimensional representation of segments (speaker space) is obtained first, followed by Gaussian mixture model (GMM) based clustering. We explore three methods of obtaining the low dimension segment representation: (i) multi-dimensional scaling (MDS) based on segment to segment stochastic distances; (ii) traditional principal component analysis (PCA), and (iii) factor analysis (i-vectors), of GMM mean super-vectors. We found that, MDS based embeddings result in better representation and hence result in better diarization performance compared to PCA and even i-vector embeddings. � 2019 IEEE. | en_US |
dc.title | Comparison of low-dimension speech segment embeddings: Application to speaker diarization | en_US |
dc.type | Book chapter | en_US |
Appears in Collections: | 2. Conference Papers |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.