Analysis of Speaker Recognition in Blended Emotional Environment Using Deep Learning Approaches

dc.contributor.authorTomar, S.
dc.contributor.authorKoolagudi, S.G.
dc.date.accessioned2026-02-06T06:34:41Z
dc.date.issued2023
dc.description.abstractGenerally, human conversation has some emotion, and natural emotions are often blended. Today’s Speaker Recognition systems lack the component of emotion. This work proposes a Speaker Recognition approaches in Blended Emotion Environment (SRBEE) system to enhance Speaker Recognition (SR) in an emotional context. Speaker Recognition algorithms nearly always achieve perfect performance in the case of neutral speech, but it is not true from an emotional perspective. This work attempts the recognition of speakers in blended emotion with the Mel-Frequency Cepstral Coefficients (MFCC) feature extraction using the Conv2D classifier. In the blended emotional environment, calculating the accuracy of the Speaker Recognition task is complex. The blend of four basic natural emotions (happy, sad, angry, and fearful) utterances tested in the proposed system to reduce SR’s complexity in a blended emotional environment. The proposed system achieves an average accuracy of 99.3% for blended emotion with neutral speech and 92.8% for four basic blended natural emotions (happy, sad, angry, and fearful). The dataset was prepared by blending two emotions in one utterance. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2023.
dc.identifier.citationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, Vol.14301 LNCS, , p. 691-698
dc.identifier.issn3029743
dc.identifier.urihttps://doi.org/10.1007/978-3-031-45170-6_72
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/29400
dc.publisherSpringer Science and Business Media Deutschland GmbH
dc.subjectBlended emotion
dc.subjectConvolutional Neural Network
dc.subjectMel Frequency Cepstral Coefficients
dc.subjectSpeaker Recognition
dc.subjectSpeaker Recognition in Blended Emotion Environment
dc.subjectValence
dc.titleAnalysis of Speaker Recognition in Blended Emotional Environment Using Deep Learning Approaches

Files