Chittaragi, N.B.Prakash, A.Koolagudi, S.G.2020-03-312020-03-312018Arabian Journal for Science and Engineering, 2018, Vol.43, 8, pp.4289-4302https://idr.nitk.ac.in/handle/123456789/10577In this paper, investigation of the significance of spectral and prosodic behaviors of speech signal has been carried out for dialect identification. Spectral features such as cepstral coefficients, spectral flux, and entropy are extracted from shorter frames. Prosodic attributes such as pitch, energy, and duration are derived from longer frames. IViE (Intonational Variations in English) speech corpus covering nine dialectal regions of British Isles has been considered, to evaluate the proposed approach. Since corpus is available in both read and semi-spontaneous modes, the influence of spectral and prosodic behavior over these datasets is distinguishably articulated. Further, two distinct classification algorithms, namely support vector machine (SVM) and an ensemble of decision trees along with the SVM are used for identification of nine dialects. Dialect discriminating information captured from both features are used for constructing feature vectors. Experiments have been conducted on individual and combinations of features. A better dialect recognition performance is observed with ensemble methods over a single independent SVM. 2017, King Fahd University of Petroleum & Minerals.Dialect Identification Using Spectral and Prosodic Features on Single and Ensemble ClassifiersArticle