Faculty Publications

Permanent URI for this communityhttps://idr.nitk.ac.in/handle/123456789/18736

Publications by NITK Faculty

Browse

Search Results

Now showing 1 - 4 of 4
  • Item
    Choice of a classifier, based on properties of a dataset: case study-speech emotion recognition
    (Springer New York LLC barbara.b.bertram@gsk.com, 2018) Koolagudi, S.G.; Vishnu Srinivasa Murthy, Y.V.S.; Bhaskar, S.P.
    In this paper, the process of selecting a classifier based on the properties of dataset is designed since it is very difficult to experiment the data on n—number of classifiers. As a case study speech emotion recognition is considered. Different combinations of spectral and prosodic features relevant to emotions are explored. The best subset of the chosen set of features is recommended for each of the classifiers based on the properties of chosen dataset. Various statistical tests have been used to estimate the properties of dataset. The nature of dataset gives an idea to select the relevant classifier. To make it more precise, three other clustering and classification techniques such as K-means clustering, vector quantization and artificial neural networks are used for experimentation and results are compared with the selected classifier. Prosodic features like pitch, intensity, jitter, shimmer, spectral features such as mel frequency cepstral coefficients (MFCCs) and formants are considered in this work. Statistical parameters of prosody such as minimum, maximum, mean (?) and standard deviation (?) are extracted from speech and combined with basic spectral (MFCCs) features to get better performance. Five basic emotions namely anger, fear, happiness, neutral and sadness are considered. For analysing the performance of different datasets on different classifiers, content and speaker independent emotional data is used, collected from Telugu movies. Mean opinion score of fifty users is collected to label the emotional data. To make it more accurate, one of the benchmark IIT-Kharagpur emotional database is used to generalize the conclusions. © 2018, Springer Science+Business Media, LLC, part of Springer Nature.
  • Item
    Islanding detection method based on image classification technique using histogram of oriented gradient features
    (Institution of Engineering and Technology jbristow@theiet.org, 2020) Manikonda, S.K.G.; Gaonkar, D.N.
    A new islanding detection method based on image classification with support vector machine is proposed in this study. Histogram of oriented gradient features is extracted from the image for classifying non-islanding and islanding events. In the proposed technique, the time-series signal acquired from the point of common coupling is first converted into an image. Histogram of oriented gradient features is extracted from the image, which is used as an input feature vector for training and testing multiple support vector machine classifiers. Parameters such as voltage, rate of change of voltage, and rate of change of negative sequence voltage are used. Furthermore, a feature for early islanding detection is also presented to detect an islanding event even before it has occurred. The detection accuracy of the proposed method is tested with different kernels. The performance of all the classifiers is tested with 5-fold cross-validation. The classification results show that islanding detection with image classification based on the histogram of oriented gradient feature and multiple support vector machine classifiers can achieve excellent results. © The Institution of Engineering and Technology 2020
  • Item
    HybridCNN based hyperspectral image classification using multiscale spatiospectral features
    (Elsevier B.V., 2020) Mohan, A.; Venkatesan, M.
    Hyperspectral images (HSIs) are contiguous band images widely used in remote sensing applications. The evolution of deep learning techniques made a significant impact on HSI classification. Several HSI processing applications rely on various Convolutional Neural Network (CNN) models. However, the higher dimensionality nature of HSIs increases the computational complexity and leads to the Hughes phenomenon. Therefore most of the CNN models perform dimensionality reduction (DR) as a preprocessing step. Another challenge in HSI classification is the consideration of both spatial and spectral features for obtaining accurate results. A few 3-D-CNN models are designed to overcome this challenge, but it takes more execution time than other methods. This research work proposes a multiscale spatio-spectral feature based hybrid CNN model for hyperspectral image classification. Hybrid DR used for optimal band extraction, which performs linear Gaussian Random Projection (GRP) and non-linear Kernel Principal Component Analysis (KPCA). The proposed hybrid CNN classification technique extracts the spectral and spatial features for different window sizes using 3D-CNN. These features concatenated and fed into a 2D-CNN for further feature extraction and classification. The hybrid model is compared against various state-of-the-art CNN based techniques and found to showcase a satisfactory result with less computational complexity. © 2020 Elsevier B.V.
  • Item
    Singer identification for Indian singers using convolutional neural networks
    (Springer, 2021) Vishnu Srinivasa Murthy, Y.V.S.; Koolagudi, S.G.; Jeshventh Raja, T.K.
    Singer identification is one of the important aspects of music information retrieval (MIR). In this work, traditional feature-based and trending convolutional neural network (CNN) based approaches are considered and compared for identifying singers. Two different datasets, namely artist20 and the Indian popular singers’ database with 20 singers are used in this work to evaluate proposed approaches. Cepstral features such as Mel-frequency cepstral coefficients (MFCCs) and linear prediction cepstral coefficients (LPCCs) are considered to represent timbre information. Shifted delta cepstral (SDC) features are also computed beside the cepstral coefficients to capture temporal information. In addition, chroma features are computed from 12 semitones of a musical octave, overall forming a 46-dimensional feature vector. Experiments are conducted with different feature combinations, and suitable features are selected using the genetic algorithm-based feature selection (GAFS) approach. Two different classification techniques, namely artificial neural networks (ANNs) and random forest (RF), are considered on the features mentioned above. Further, spectrograms and chromagrams of audio clips are directly fed to CNN for classification. The singer identification results obtained using CNNs seem to be better than the traditional isolated and ensemble classifiers. Average accuracy of around 75% is observed with CNN in the case of Indian popular singers database. Whereas, on artist20 dataset, the proposed configuration of feature-based approach and CNN could not give better than 60% accuracy. © 2021, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.