Dialect Identification Using Spectral and Prosodic Features on Single and Ensemble Classifiers

No Thumbnail Available

Date

2018

Authors

Chittaragi, N.B.
Prakash, A.
Koolagudi, S.G.

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

In this paper, investigation of the significance of spectral and prosodic behaviors of speech signal has been carried out for dialect identification. Spectral features such as cepstral coefficients, spectral flux, and entropy are extracted from shorter frames. Prosodic attributes such as pitch, energy, and duration are derived from longer frames. IViE (Intonational Variations in English) speech corpus covering nine dialectal regions of British Isles has been considered, to evaluate the proposed approach. Since corpus is available in both read and semi-spontaneous modes, the influence of spectral and prosodic behavior over these datasets is distinguishably articulated. Further, two distinct classification algorithms, namely support vector machine (SVM) and an ensemble of decision trees along with the SVM are used for identification of nine dialects. Dialect discriminating information captured from both features are used for constructing feature vectors. Experiments have been conducted on individual and combinations of features. A better dialect recognition performance is observed with ensemble methods over a single independent SVM. 2017, King Fahd University of Petroleum & Minerals.

Description

Keywords

Citation

Arabian Journal for Science and Engineering, 2018, Vol.43, 8, pp.4289-4302

Endorsement

Review

Supplemented By

Referenced By