Convolutional Neural Network-Enabling Speech Command Recognition

dc.contributor.authorPatra, A.
dc.contributor.authorPandey, C.
dc.contributor.authorPalaniappan, K.
dc.contributor.authorSethy, P.K.
dc.date.accessioned2026-02-08T16:50:06Z
dc.date.issued2023
dc.description.abstractThe speech command recognition system based on deep image classification is the key that would tremendously promise to revolutionize research and development by overcoming the communication barrier between human and machine or computer. We are all aware of challenges in identifying the voice command in noise and variability in speed, pitch, and projection. This paper has developed an efficient and highly accurate speech command recognition for smart and effective speech processing applications like modern telecommunication. In particular, a novel convolutional neural network (CNN) is presented that works with a one-second audio clip consisting of one specific word including ten speech commands and other words labeled as “unknown,” and model implementations were operated in the noisy environment. The CNNs are structurally fully developed in such a way to recognize the speech commands with the utilization of deep learning (DL) for image classification concepts. Thus, this research used the concept of DL for image classification to translate the problem of speech command recognition into the image domain. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
dc.identifier.citationLecture Notes on Data Engineering and Communications Technologies, 2023, Vol.141, , p. 321-332
dc.identifier.issn23674512
dc.identifier.urihttps://doi.org/10.1016/j.jenvman.2025.127946
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/33650
dc.publisherSpringer Science and Business Media Deutschland GmbH
dc.subjectCNN
dc.subjectDeep learning
dc.subjectImage classification
dc.subjectSpectrogram
dc.subjectSpeech command recognition
dc.titleConvolutional Neural Network-Enabling Speech Command Recognition

Files

Collections