Contribution of frequency compressed temporal fine structure cues to the speech recognition in noise: An implication in cochlear implant signal processing

Poluboina, V.; Pulikala, A.; Pitchai Muthu, A.N.

Contribution of frequency compressed temporal fine structure cues to the speech recognition in noise: An implication in cochlear implant signal processing

dc.contributor.author	Poluboina, V.
dc.contributor.author	Pulikala, A.
dc.contributor.author	Pitchai Muthu, A.N.
dc.date.accessioned	2026-02-04T12:28:15Z
dc.date.issued	2022
dc.description.abstract	The study investigated the effect of proportionally frequency compressed encoding of temporal fine structure information on speech perception in noise using vocoder simulations of cochlear implant signal processing. The study proposed a pitch synchronous overlap-add algorithm (PSOLA) for downward frequency shifting of TFS. The speech recognition scores (SRS) were measured at −10 dB, 0 dB, and +10 dB for eight signal processing conditions corresponding to sinewave vocoder without TFS (NO-TFS), four unshifted TFS conditions including full band TFS, TFS up to 2000, 1000, and 600 Hz, and three conditions with PSOLA which shifted 2000, 1000 and 600 Hz TFS to 1000, 500 and 300 Hz respectively. The original envelope was unchanged across the conditions. SRS at +10 dB and −10 dB SNR reached ceiling and floor respectively, in most conditions. Hence, SRS at 0 dB SNR was compared across the conditions. The results showed that the SRS was highest with full band TFS and lowest for the NO-TFS condition.The SRS for TFS 600 Hz shifted to 300 Hz through PSOLA was higher than the NO-TFS condition. Study findings suggest that encoding TFS by proportional frequency compression results in better speech perception in noise compared to NO-TFS. An important observation of this current study is that the speech recognition was better than the sine wave vocoder for all TFS conditions including frequency compressed 600 Hz TFS. © 2021 Elsevier Ltd
dc.identifier.citation	Applied Acoustics, 2022, 189, , pp. -
dc.identifier.issn	0003682X
dc.identifier.uri	https://doi.org/10.1016/j.apacoust.2021.108616
dc.identifier.uri	https://idr.nitk.ac.in/handle/123456789/22668
dc.publisher	Elsevier Ltd
dc.subject	Encoding (symbols)
dc.subject	Signal encoding
dc.subject	Signal to noise ratio
dc.subject	Speech
dc.subject	Speech recognition
dc.subject	Vocoders
dc.subject	Cochlear implant signal processing
dc.subject	Condition
dc.subject	Frequency compression
dc.subject	Overlap-add algorithm
dc.subject	Pitch synchronous
dc.subject	Proportional frequency compression
dc.subject	Signal-processing
dc.subject	Speech perception
dc.subject	Temporal fine structure
dc.subject	Vocod simulation
dc.subject	Cochlear implants
dc.title	Contribution of frequency compressed temporal fine structure cues to the speech recognition in noise: An implication in cochlear implant signal processing

Collections

Journal Articles

Contribution of frequency compressed temporal fine structure cues to the speech recognition in noise: An implication in cochlear implant signal processing

Files

Collections