Conference Papers
Permanent URI for this collectionhttps://idr.nitk.ac.in/handle/123456789/28506
Browse
2 results
Search Results
Item Speech enhancement using multiple deep neural networks(Institute of Electrical and Electronics Engineers Inc., 2018) Karjol, P.; Kumar, M.A.; Ghosh, P.K.In this work, we present a variant of multiple deep neural network (DNN) based speech enhancement method. We directly estimate clean speech spectrum as a weighted average of outputs from multiple DNNs. The weights are provided by a gating network. The multiple DNNs and the gating network are trained jointly. The objective function is set as the mean square logarithmic error between the target clean spectrum and the estimated spectrum. We conduct experiments using two and four DNNs using the TIMIT corpus with nine noise types (four seen noises and five unseen noises) taken from the AURORA database at four different signal-to-noise ratios (SNRs). We also compare the proposed method with a single DNN based speech enhancement scheme and existing multiple DNN schemes using segmental SNR, perceptual evaluation of speech quality (PESQ) and short-term objective intelligibility (STOI) as the evaluation metrics. These comparisons show the superiority of proposed method over baseline schemes in both seen and unseen noises. Specifically, we observe an absolute improvement of 0.07 and 0.04 in PESQ measure compared to single DNN when averaged over all noises and SNRs for seen and unseen noise cases respectively. © 2018 IEEE.Item Speech Intelligibility Enhancement for Cochlear Implant using Multi-Objective Deep Denoising Autoencoder(Institute of Electrical and Electronics Engineers Inc., 2023) Vishnu, B.U.P.; Poluboina, V.; Sushma, B.; Pulikala, A.This study introduces a novel technique for enhancing the performance of deep denoising autoencoders (DDAE) in speech processing for cochlear implants (CIs). For individuals with hearing loss, cochlear implants are electronic devices that help to restore their ability to hear. However, the performance of CIs speech intelligibility in the noisy environment is limited. One of the most commonly used methods for reducing noise in CIs is through a preprocessing technique called deep denoising autoencoder. DDAE models have shown potential in learning various noise patterns, but their performance in enhancing speech intelligibility is relatively low due to a ineffective objective function. To address this limitation, this study proposes a multi-objective technique to fine-tune the DDAE model. When multiple objectives are optimized simultaneously, the model becomes more robust and better at handling real-time noise. Based on the experimental findings, it has been confirmed that the proposed multi-objective learning technique performs better than other models when it comes to speech intelligibility. Furthermore, the enhanced signal is presented to the acoustic cochlear implant simulator to evaluate the improvement of speech intelligibility in CIs. © 2023 IEEE.
