Speech de-identification data augmentation leveraging large language model

Dhingra, P.; Agrawal, S.; Veerappan, C.S.; Ho, T.N.; Chng, E.S.; Tong, R.

Speech de-identification data augmentation leveraging large language model

dc.contributor.author	Dhingra, P.
dc.contributor.author	Agrawal, S.
dc.contributor.author	Veerappan, C.S.
dc.contributor.author	Ho, T.N.
dc.contributor.author	Chng, E.S.
dc.contributor.author	Tong, R.
dc.date.accessioned	2026-02-06T06:33:56Z
dc.date.issued	2024
dc.description.abstract	This work addresses the challenge of limited real-world speech data in speech de-identification, the process of removing Personally Identifiable Information (PII). We formulate speech de-identification as a named entity recognition (NER) task specifically for spoken English. To overcome data scarcity and enhance NER performance, we propose a data augmentation approach. This approach leverages a large language model to generate synthetic speech style text data enriched with diverse PII entities. The generated data undergoes an iterative process using a customized NER model for semi-automatic PII annotation. Our analysis demonstrates the effectiveness of this data augmentation strategy in significantly improving NER performance on spoken language text. Furthermore, to gain deeper insights into the specific errors made during NER, we employ performance analysis using alternative evaluation metrics. Â© 2024 IEEE.
dc.identifier.citation	Proceedings of 2024 International Conference on Asian Language Processing, IALP 2024, 2024, Vol., , p. 97-102
dc.identifier.uri	https://doi.org/10.1109/IALP63756.2024.10661176
dc.identifier.uri	https://idr.nitk.ac.in/handle/123456789/28941
dc.publisher	Institute of Electrical and Electronics Engineers Inc.
dc.subject	data augmentation
dc.subject	large language model
dc.subject	named entity recognition
dc.subject	speech de-identification
dc.title	Speech de-identification data augmentation leveraging large language model

Collections

Conference Papers

Speech de-identification data augmentation leveraging large language model

Files

Collections