NITK-KLESC: Kannada Language Emotional Speech Corpus for Speaker Recognition

dc.contributor.authorTomar, S.
dc.contributor.authorGupta, P.
dc.contributor.authorKoolagudi, S.G.
dc.date.accessioned2026-02-06T06:34:28Z
dc.date.issued2023
dc.description.abstractThis work introduces an emotional speech dataset for Speaker Recognition (SR) task. The proposed dataset is recorded in the Kannada language from the people of Karnataka state of India. The speech dataset is collected by simulating five different emotions, such as Fear, Sad, Anger, Happy, and Neutral. The dataset is named as National Institute of Technology Karnataka, India- Kannada Language Emotional Speech Corpus (NITK-KLESC). The proposed dataset will be useful for SR tasks in various emotions. The proposed emotional speech dataset will be useful for emotion recognition, analysis of emotional speech, speech recognition, gender identification, and age identification of the age group 20 to 50 years. The proposed work describes the development, processing, analysis, acquisition, and evaluation of the proposed emotional speech dataset (NITK-KLESC). The analysis of emotional speech was done by considering various basic speech parameters like Pitch, Tempo, Intensity, and Zero Crossing Rate (ZCR). The characteristics of the dataset are reported using MFCC feature extraction and considered the CNN model as a classifier, compared with the existing EmoDB dataset. The average accuracy of the Emotional Speech Speaker Recognition (ESSR) task was measured at 84.44% with the EmoDB dataset and 95.2% with the proposed NITK-KLESC dataset. © 2023 IEEE.
dc.identifier.citationProceedings of 2023 26th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardization of Speech Databases and Assessment Techniques, O-COCOSDA 2023, 2023, Vol., , p. -
dc.identifier.urihttps://doi.org/10.1109/O-COCOSDA60357.2023.10482961
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/29265
dc.publisherInstitute of Electrical and Electronics Engineers Inc.
dc.subjectMel Frequency Cepstral Coefficient
dc.subjectPitch
dc.subjectSpeaker Recognition
dc.subjectSpeaker Recognition in Emotional Environment
dc.subjectTempo
dc.subjectZero Crossing Rate
dc.titleNITK-KLESC: Kannada Language Emotional Speech Corpus for Speaker Recognition

Files