Please use this identifier to cite or link to this item: https://idr.nitk.ac.in/jspui/handle/123456789/6907
Title: Voice activity detection from the breathing pattern of the speaker
Authors: Ramakrishnan, A.G.
Krishnan, G.
Srivathsan, S.
Issue Date: 2018
Citation: 2017 14th IEEE India Council International Conference, INDICON 2017, 2018, Vol., , pp.-
Abstract: In this paper, we propose a method to perform voice activity detection using only the breathing signal of a speaker. Human breathing and speech production go hand in hand. Normal respiration and respiration during speech have a different profile. The former is generally symmetric as compared to an asymmetric profile in the case of respiration during speech. Impedance pneumography provides a mechanism to capture chest expansions and compressions due to breathing. We have recorded the breathing signal along with the speech audio for 44 subjects while they were speaking and quiet. We have classified cycles of breathing into two classes, namely during speech and normal, using the cycle-synchronous discrete cosine transform coefficients of the breathing signal with different classifiers. The best accuracy of 96.4% is obtained using the k-nearest neighbor classifier. From the classified breathing cycles, we determine the intervals when a subject is quiet and when he is speaking. We use the corresponding timeframes on the simultaneously recorded audio and achieve a good accuracy in voice activity detection. Compared to the earlier reported time resolution of 30 sec, we obtain a decision for every breathing cycle, which works out to an average resolution of about 3 sec. � 2017 IEEE.
URI: http://idr.nitk.ac.in/jspui/handle/123456789/6907
Appears in Collections:2. Conference Papers

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.