Performance analysis of LPC and MFCC features in voice conversion using artificial neural networks

No Thumbnail Available

Date

2017

Journal Title

Journal ISSN

Volume Title

Publisher

Springer Verlag service@springer.de

Abstract

Voice Conversion is a technique in which source speakers voice is morphed to a target speakers voice by learning source–target relationship from a number of utterances from source and the target. There are many applications which may benefit from this sort of technology for example dubbing movies, TV-shows, TTS systems and so on. In this paper, analysis on the performance of ANN-based Voice Conversion system is done using linear predictive coding (LPC) and mel-frequency cepstral coefficients (MFCCs). Experimental results show that Voice Conversion system based on LPC features is better than the ones based on MFCC features. © Springer Science+Business Media Singapore 2017.

Description

Keywords

Linear predictive coding and neural networks, Mel-frequency cepstral coefficients, Morphing, Voice conversion

Citation

Advances in Intelligent Systems and Computing, 2017, Vol.469, , p. 275-280

Endorsement

Review

Supplemented By

Referenced By