Convolutional Neural Network-Enabling Speech Command Recognition
No Thumbnail Available
Date
2023
Journal Title
Journal ISSN
Volume Title
Publisher
Springer Science and Business Media Deutschland GmbH
Abstract
The speech command recognition system based on deep image classification is the key that would tremendously promise to revolutionize research and development by overcoming the communication barrier between human and machine or computer. We are all aware of challenges in identifying the voice command in noise and variability in speed, pitch, and projection. This paper has developed an efficient and highly accurate speech command recognition for smart and effective speech processing applications like modern telecommunication. In particular, a novel convolutional neural network (CNN) is presented that works with a one-second audio clip consisting of one specific word including ten speech commands and other words labeled as “unknown,” and model implementations were operated in the noisy environment. The CNNs are structurally fully developed in such a way to recognize the speech commands with the utilization of deep learning (DL) for image classification concepts. Thus, this research used the concept of DL for image classification to translate the problem of speech command recognition into the image domain. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
Description
Keywords
CNN, Deep learning, Image classification, Spectrogram, Speech command recognition
Citation
Lecture Notes on Data Engineering and Communications Technologies, 2023, Vol.141, , p. 321-332
