Conference Papers

Permanent URI for this collectionhttps://idr.nitk.ac.in/handle/123456789/28506

Browse

Search Results

Now showing 1 - 2 of 2

A novel approach to video copy detection using audio fingerprints and PCA
(Elsevier B.V., 2011) Roopalakshmi, R.; Guddeti, G.R.M.
In Content-Based Copy detection (CBCD) literature, numerous state-of-the-art techniques are primarily focusing on visual content of video. Exploiting audio fingerprints for CBCD problem is necessary, because of following rea-sons: audio content constitutes an indispensable information source; transformations on audio content is limited compared to visual content. In this paper, a novel CBCD approach using audio features and PCA is proposed, which includes two stages: first, multiple feature vectors are computed by utilizing MFCC and four spectral descriptors; second, features are further processed using PCA, to provide compact feature description. The results of experiments tested on TRECVID-2007 dataset, demonstrate the efficiency of proposed method against various transformations. Â© 2011 Published by Elsevier Ltd.
Multiclass SVM-based language-independent emotion recognition using selective speech features
(Institute of Electrical and Electronics Engineers Inc., 2014) Kokane Amol, T.; Guddeti, G.R.M.
In this paper, we emphasize on recognizing six basic emotions viz. Anger, Disgust, Fear, Happiness, Neutral and Sadness using selective features of speech signal of different languages like Germen and Telugu. The feature set includes thirteen Mel-Frequency Cepstral Coefficients (MFCC) and four other features of speech signal such as Energy, Short Term Energy, Spectral Roll-Off and Zero-Crossing Rate (ZCR). The Surrey Audio-Visual Expressed Emotion (SAVEE) Database is used to train the Multiclass Support Vector Machine (SVM) classifier and a German Corpus EMO-DB (Berlin Database of Emotional Speech) and Telugu Corpus IITKGP: SESC are used for emotion recognition. The results are analyzed for each speech emotion separately and obtained accuracies of 98.3071% and 95.8166 % for Emo-DB, IITKGP: SESC databases respectively. Â© 2014 IEEE.

Conference Papers

Browse

Filters

Settings

Sort By

Results per page

Search Results