Acoustic-phonetic feature based Kannada dialect identification from vowel sounds

No Thumbnail Available

Date

2019

Authors

Chittaragi, N.B.
Koolagudi, S.G.

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

In this paper, a dialect identification system is proposed for Kannada language using vowels sounds. Dialectal cues are characterized through acoustic parameters such as formant frequencies (F1 F3), and prosodic features [energy, pitch (F0), and duration]. For this purpose, a vowel dataset is collected from native speakers of Kannada belonging to different dialectal regions. Global features representing frame level global statistics such as mean, minimum, maximum, standard deviation and variance are extracted from vowel sounds. Local features representing temporal dynamic properties from the contour level are derived from the steady-state vowel region. Three decision tree-based ensemble algorithms, namely random forest, extreme random forest (ERF) and extreme gradient boosting algorithms are used for classification. Performance of both global and local features is evaluated individually. Further, the significance of every feature in dialect discrimination is analyzed using single factor-ANOVA (analysis of variances) tests. Global features with ERF ensemble model has shown a better average dialect identification performance of around 76%. Also, the contribution of every feature in dialect identification is verified. The role of duration, energy, pitch, and three formant features is found to be evidential in Kannada dialect classification. 2019, Springer Science+Business Media, LLC, part of Springer Nature.

Description

Keywords

Citation

International Journal of Speech Technology, 2019, Vol.22, 4, pp.1099-1113

Endorsement

Review

Supplemented By

Referenced By