Voice activity detection from the breathing pattern of the speaker

dc.contributor.authorRamakrishnan, A.G.
dc.contributor.authorKrishnan, G.
dc.contributor.authorSrivathsan, S.
dc.date.accessioned2026-02-06T06:37:56Z
dc.date.issued2018
dc.description.abstractIn this paper, we propose a method to perform voice activity detection using only the breathing signal of a speaker. Human breathing and speech production go hand in hand. Normal respiration and respiration during speech have a different profile. The former is generally symmetric as compared to an asymmetric profile in the case of respiration during speech. Impedance pneumography provides a mechanism to capture chest expansions and compressions due to breathing. We have recorded the breathing signal along with the speech audio for 44 subjects while they were speaking and quiet. We have classified cycles of breathing into two classes, namely during speech and normal, using the cycle-synchronous discrete cosine transform coefficients of the breathing signal with different classifiers. The best accuracy of 96.4% is obtained using the k-nearest neighbor classifier. From the classified breathing cycles, we determine the intervals when a subject is quiet and when he is speaking. We use the corresponding timeframes on the simultaneously recorded audio and achieve a good accuracy in voice activity detection. Compared to the earlier reported time resolution of 30 sec, we obtain a decision for every breathing cycle, which works out to an average resolution of about 3 sec. © 2017 IEEE.
dc.identifier.citation2017 14th IEEE India Council International Conference, INDICON 2017, 2018, Vol., , p. -
dc.identifier.urihttps://doi.org/10.1109/INDICON.2017.8487643
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/31315
dc.publisherInstitute of Electrical and Electronics Engineers Inc.
dc.subjectBreathing pattern
dc.subjectCycle-synchronous DCT
dc.subjectImpedance pneumography
dc.subjectSpeech
dc.subjectSupport vector machine
dc.subjectVoice activity detection
dc.titleVoice activity detection from the breathing pattern of the speaker

Files