Conference Papers

Permanent URI for this collectionhttps://idr.nitk.ac.in/handle/123456789/28506

Browse

Search Results

Now showing 1 - 1 of 1
  • Item
    Enhancing Speech De-Identification with LLM-Based Data Augmentation
    (Institute of Electrical and Electronics Engineers Inc., 2024) Dhingra, P.; Agrawal, S.; Veerappan, C.S.; Chng, E.S.; Tong, R.
    This paper addresses the challenge of data scarcity in speech de-identification by introducing a novel, fully automated data augmentation method leveraging large language models. Our approach overcomes the limitations of human annotation, enabling the creation of extensive training datasets. To enhance de-identification performance, we compare pipeline and end-to-end models. While the pipeline approach sequentially applies speech recognition and named entity recognition, the end-to-end model jointly learns these tasks. Experimental results demonstrate the effectiveness of our data augmentation strategy and the superiority of the end-to-end model in improving PII detection accuracy and robustness. © 2024 IEEE.