Nisp: A multi-lingual multi-accent dataset for speaker profiling

dc.contributor.authorKalluri, S.B.
dc.contributor.authorVijayasenan, D.
dc.contributor.authorGanapathy, S.
dc.contributor.authorRajan, M.
dc.contributor.authorKrishnan, P.
dc.date.accessioned2026-02-06T06:36:15Z
dc.date.issued2021
dc.description.abstractMany commercial and forensic applications of speech demand the extraction of information about the speaker characteristics, which falls into the broad category of speaker profiling. The speaker characteristics needed for profiling include physical traits of the speaker like height, age, and gender of the speaker along with the native language of the speaker. Many of the datasets available have only partial information for speaker profiling. In this paper, we attempt to overcome this limitation by developing a new dataset which has speech data from five different Indian languages along with English. The metadata information for speaker profiling applications like linguistic information, regional information, and physical characteristics of a speaker are also collected. We call this dataset as NITK-IISc Multilingual Multi-accent Speaker Profiling (NISP) dataset. The description of the dataset, potential applications, and baseline results for speaker profiling on this dataset are provided in this paper. © 2021 IEEE.
dc.identifier.citationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2021, Vol.2021-June, , p. 6953-6957
dc.identifier.issn07367791; 15206149
dc.identifier.urihttps://doi.org/10.1109/ICASSP39728.2021.9414349
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/30332
dc.publisherInstitute of Electrical and Electronics Engineers Inc.
dc.subjectNISP dataset
dc.subjectPhysical parameters
dc.subjectSpeaker profiling
dc.subjectVoice forensics
dc.titleNisp: A multi-lingual multi-accent dataset for speaker profiling

Files