Conference Papers

Permanent URI for this collectionhttps://idr.nitk.ac.in/handle/123456789/28506

Browse

Search Results

Now showing 1 - 5 of 5
  • Item
    Spectral Feature Based Kannada Dialect Classification from Stop Consonants
    (Springer, 2019) Chittaragi, N.B.; Hegde, P.; Mothukuri, S.K.P.; Koolagudi, G.K.
    This study focuses on the investigation of the significance of stop consonants in view of the classification of Kannada dialects. Majority of the studies proposed have shown the existence of evidential differences in the pronunciation of vowels across dialects. However, consonant based studies on dialect processing are found to be comparatively lesser. In this work, eight stop consonants are used for characterization of five Kannada dialects. Acoustic characteristics such as cepstral coefficients, formant frequencies, spectral flux, and rolloff features are explored from spectral analysis of stops. The consonant dataset is derived from standard Kannada dialect dataset consisting of 2417 consonants obtained from 16 native speakers from each dialect. Support vector machine (SVM) and decision tree-based extreme gradient boosting (XGB) ensemble classification methods are employed for automatic recognition of Kannada dialects. The research findings show that the stops existing for shorter duration also convey dialectal linguistic cues. Combination of spectral properties has contributed to the identification of distinct dialect-specific information across Kannada dialects. © 2019, Springer Nature Switzerland AG.
  • Item
    Image Colorization Using GANs and Perceptual Loss
    (Institute of Electrical and Electronics Engineers Inc., 2020) Sankar, R.; Nair, A.; Abhinav, P.; Mothukuri, S.K.P.; Koolagudi, S.G.
    Image colorization is of great use for several applications, such as the restoration of old images, as well as enabling the storage of grayscale images, which take up less space, which can later be colorized. But this problem is hard since there exist many possible color combinations for a particular grayscale image. Recent developments have aimed to solve this problem using deep learning. But, for achieving good performance, they require highly processed inputs, along with additional elements, such as semantic maps. In this paper, an attempt has been made for generalizing the procedure of colorization using a conditional Deep Convolutional Generative Adversarial Network (DCGAN) by adding "Perceptual Loss". The network is trained over the CIFAR-100 dataset. The results of the proposed generative model with perceptual loss are compared with the existing state-of-the-art systems normal GAN model and U-Net Convolutional model. © 2020 IEEE.
  • Item
    Kannada Dialect Classification using Artificial Neural Networks
    (Institute of Electrical and Electronics Engineers Inc., 2020) Mothukuri, S.K.P.; Hegde, P.; Chittaragi, N.B.; Koolagudi, S.G.
    In this paper, Automatic Dialect Classification (ADC) system is proposed for dialects of Kannada language (the Dravidian language spoken in Southern Karnataka). ADC system is proposed by extracting spectral Mel Frequency Cepstral Coefficients (MFCCs), and log filter bank features along with Linear predictive coefficients. In addition, prosodic pitch and energy features are extracted to capture dialect specific cues. A Kannada dialect speech corpus consisting of five prominent dialects of Kannada language is used for designing the ADC system. An attempt is made by using Artificial Neural Networks (ANNs) technique for classification of Kannada dialects. As, recently, ANNs and its variants are gaining more popularity in the area of speech processing application. Hyperparameter tuning of ANN has resulted with an increase in performance. © 2020 IEEE.
  • Item
    Efficient Traffic Signboard Recognition System Using Convolutional Networks
    (Springer, 2020) Mothukuri, S.K.P.; Tejas, R.; Patil, S.; Darshan, V.; Koolagudi, S.G.
    In this paper, a smart automatic traffic sign recognition system is proposed. This signboard recognition system plays a vital role in the automated driving system of transport vehicles. The model is built based on convolutional neural network. The German Traffic Sign Detection Benchmark (GTSDB), a standard open-source segmented image dataset with forty-three different signboard classes is considered for experimentation. Implementation of the system is highly focused on processing speed and classification accuracy. These aspects are concentrated, such that the built model is suitable for real-time automated driving systems. Similar experiments are carried in comparison with the pre-trained convolution models. The performance of the proposed model is better in the aspects of fast responsive time. © Springer Nature Singapore Pte Ltd. 2020.
  • Item
    Kannada Dialect Classification Using CNN
    (Springer Science and Business Media Deutschland GmbH, 2020) Hegde, P.; Chittaragi, N.B.; Mothukuri, S.K.P.; Koolagudi, S.G.
    Kannada is one of the prominent languages spoken in southern India. Since the Kannada is a lingua franca and spoken by more than 70 million people, it is evident to have dialects. In this paper, we identified five major dialectal regions in Karnataka state. An attempt is made to classify these five dialects from sentence-level utterances. Sentences are segmented from continuous speech automatically by using spectral centroid and short term energy features. Mel frequency cepstral coefficient (MFCC) features are extracted from these sentence units. These features are used to train the convolutional neural networks (CNN). Along with MFCCs, shifted delta and double delta coefficients are also attempted to train the CNN model. The proposed CNN based dialect recognition system is also tested with internationally known standard Intonation Variation in English (IViE) dataset. The CNN model has resulted in better performance. It is observed that the use of one convolution layer and three fully connected layers balances computational complexity and results in better accuracy with both Kannada and English datasets. © 2020, Springer Nature Switzerland AG.