Kannada Dialect Classification Using CNN

Hegde P.; Chittaragi N.B.; Mothukuri S.K.P.; Koolagudi S.G.

Kannada Dialect Classification Using CNN

Date

2020

Authors

Hegde P.

Chittaragi N.B.

Mothukuri S.K.P.

Koolagudi S.G.

Abstract

Kannada is one of the prominent languages spoken in southern India. Since the Kannada is a lingua franca and spoken by more than 70 million people, it is evident to have dialects. In this paper, we identified five major dialectal regions in Karnataka state. An attempt is made to classify these five dialects from sentence-level utterances. Sentences are segmented from continuous speech automatically by using spectral centroid and short term energy features. Mel frequency cepstral coefficient (MFCC) features are extracted from these sentence units. These features are used to train the convolutional neural networks (CNN). Along with MFCCs, shifted delta and double delta coefficients are also attempted to train the CNN model. The proposed CNN based dialect recognition system is also tested with internationally known standard Intonation Variation in English (IViE) dataset. The CNN model has resulted in better performance. It is observed that the use of one convolution layer and three fully connected layers balances computational complexity and results in better accuracy with both Kannada and English datasets. © 2020, Springer Nature Switzerland AG.

Citation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , Vol. 11987 LNAI , , p. 254 - 259

URI

https://doi.org/10.1007/978-3-030-66187-8_24
https://idr.nitk.ac.in/handle/123456789/14908

Collections

2. Conference Papers

Full item page

Kannada Dialect Classification Using CNN

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By