Predicting ICD-9 code groups with fuzzy similarity based supervised multi-label classification of unstructured clinical nursing notes

dc.contributor.authorGangavarapu, T.
dc.contributor.authorJayasimha, A.
dc.contributor.authorS. Krishnan, G.S.
dc.contributor.authorKamath S?, S.
dc.date.accessioned2026-02-05T09:28:53Z
dc.date.issued2020
dc.description.abstractIn hospitals, caregivers are trained to chronicle the subtle changes in the clinical conditions of a patient at regular intervals, for enabling decision-making. Caregivers’ text-based clinical notes are a significant source of rich patient-specific data, that can facilitate effective clinical decision support, despite which, this treasure-trove of data remains largely unexplored for supporting the prediction of clinical outcomes. The application of sophisticated data modeling and prediction algorithms with greater computational capacity have made disease prediction from raw clinical notes a relevant problem. In this paper, we propose an approach based on vector space and topic modeling, to structure the raw clinical data by capturing the semantic information in the nursing notes. Fuzzy similarity based data cleansing approach was used to merge anomalous and redundant patient data. Furthermore, we utilize eight supervised multi-label classification models to facilitate disease (ICD-9 code group) prediction. We present an exhaustive comparative study to evaluate the performance of the proposed approaches using standard evaluation metrics. Experimental validation on MIMIC-III, an open database, underscored the superior performance of the proposed Term weighting of unstructured notes AGgregated using fuzzy Similarity (TAGS) model, which consistently outperformed the state-of-the-art structured data based approach by 7.79% in AUPRC and 1.24% in AUROC. © 2019 Elsevier B.V.
dc.identifier.citationKnowledge-Based Systems, 2020, 190, , pp. -
dc.identifier.issn9507051
dc.identifier.urihttps://doi.org/10.1016/j.knosys.2019.105321
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/24043
dc.publisherElsevier B.V.
dc.subjectArtificial intelligence
dc.subjectClassification (of information)
dc.subjectDecision making
dc.subjectDecision support systems
dc.subjectForecasting
dc.subjectFuzzy logic
dc.subjectHospital data processing
dc.subjectLearning algorithms
dc.subjectLearning systems
dc.subjectNatural language processing systems
dc.subjectNursing
dc.subjectSemantics
dc.subjectVector spaces
dc.subjectClinical decision support
dc.subjectClinical decision support systems
dc.subjectComputational capacity
dc.subjectExperimental validations
dc.subjectModeling and predictions
dc.subjectMulti label classification
dc.subjectNAtural language processing
dc.subjectSemantic information
dc.subjectCodes (symbols)
dc.titlePredicting ICD-9 code groups with fuzzy similarity based supervised multi-label classification of unstructured clinical nursing notes

Files

Collections