Recognition of emotions from video using acoustic and facial features

Sreenivasa Rao, K.S.; Koolagudi, S.

Recognition of emotions from video using acoustic and facial features

dc.contributor.author	Sreenivasa Rao, K.S.
dc.contributor.author	Koolagudi, S.
dc.date.accessioned	2026-02-05T09:33:41Z
dc.date.issued	2015
dc.description.abstract	In this paper, acoustic and facial features extracted from video are explored for recognizing emotions. The temporal variation of gray values of the pixels within eye and mouth regions is used as a feature to capture the emotion-specific knowledge from the facial expressions. Acoustic features representing spectral and prosodic information are explored for recognizing emotions from the speech signal. Autoassociative neural network models are used to capture the emotion-specific information from acoustic and facial features. The basic objective of this work is to examine the capability of the proposed acoustic and facial features in view of capturing the emotion-specific information. Further, the correlations among the feature sets are analyzed by combining the evidences at different levels. The performance of the emotion recognition system developed using acoustic and facial features is observed to be 85.71 and 88.14 %, respectively. It has been observed that combining the evidences of models developed using acoustic and facial features improved the recognition performance to 93.62 %. The performance of the emotion recognition systems developed using neural network models is compared with hidden Markov models, Gaussian mixture models and support vector machine models. The proposed features and models are evaluated on real-life emotional database, Interactive Emotional Dyadic Motion Capture database, which was recently collected at University of Southern California. © 2013, Springer-Verlag London.
dc.identifier.citation	Signal, Image and Video Processing, 2015, 9, 5, pp. 1029-1045
dc.identifier.issn	18631703
dc.identifier.uri	https://doi.org/10.1007/s11760-013-0522-6
dc.identifier.uri	https://idr.nitk.ac.in/handle/123456789/26255
dc.publisher	Springer-Verlag London Ltd
dc.subject	Hidden Markov models
dc.subject	Markov processes
dc.subject	Neural networks
dc.subject	Speech recognition
dc.subject	Trellis codes
dc.subject	Acoustic features
dc.subject	Autoassociative neural networks
dc.subject	Emotion recognition
dc.subject	Facial feature
dc.subject	Prosodic features
dc.subject	Face recognition
dc.title	Recognition of emotions from video using acoustic and facial features

Collections

Journal Articles

Recognition of emotions from video using acoustic and facial features

Files

Collections