Faculty Publications

Permanent URI for this communityhttps://idr.nitk.ac.in/handle/123456789/18736

Publications by NITK Faculty

Browse

Search Results

Now showing 1 - 9 of 9
  • Item
    A supervised learning approach for ICU mortality prediction based on unstructured electrocardiogram text reports
    (Springer Verlag service@springer.de, 2018) S. Krishnan, G.S.; Kamath S․, S.
    Extracting patient data documented in text-based clinical records into a structured form is a predominantly manual process, both time and cost-intensive. Moreover, structured patient records often fail to effectively capture the nuances of patient-specific observations noted in doctors’ unstructured clinical notes and diagnostic reports. Automated techniques that utilize such unstructured text reports for modeling useful clinical information for supporting predictive analytics applications can thus be highly beneficial. In this paper, we propose a neural network based method for predicting mortality risk of ICU patients using unstructured Electrocardiogram (ECG) text reports. Word2Vec word embedding models were adopted for vectorizing and modeling textual features extracted from the patients’ reports. An unsupervised data cleansing technique for identification and removal of anomalous data/special cases was designed for optimizing the patient data representation. Further, a neural network model based on Extreme Learning Machine architecture was proposed for mortality prediction. ECG text reports available in the MIMIC-III dataset were used for experimental validation. The proposed model when benchmarked against four standard ICU severity scoring methods, outperformed all by 10–13%, in terms of prediction accuracy. © 2018, Springer International Publishing AG, part of Springer Nature.
  • Item
    A Supervised Approach for Patient-Specific ICU Mortality Prediction Using Feature Modeling
    (Springer Verlag service@springer.de, 2019) S. Krishnan, G.S.; Kamath S․, S.K.
    Intensive Care Units (ICUs) are one of the most essential, but expensive healthcare services provided in hospitals. Modern monitoring machines in critical care units continuously generate huge amount of data, which can be used for intelligent decision-making. Prediction of mortality risk of patients is one such predictive analytics application, which can assist hospitals and healthcare personnel in making informed decisions. Traditional scoring systems currently in use are parametric scoring methods which often suffer from low accuracy. In this paper, an empirical study on the effect of feature selection on the feature set of traditional scoring methods for modeling an optimal feature set to represent each patient’s profile along with a supervised learning approach for ICU mortality prediction have been presented. Experimental evaluation of the proposed approach in comparison to standard severity scores like SAPS-II, SOFA and OASIS showed that the proposed model outperformed them by a margin of 12–16% in terms of prediction accuracy. © 2019, Springer Nature Singapore Pte Ltd.
  • Item
    Diagnostic Code Group Prediction by Integrating Structured and Unstructured Clinical Data
    (Springer Science and Business Media Deutschland GmbH, 2021) Prabhakar, A.; Shidharth, S.; S. Krishnan, G.S.; Kamath S․, S.
    Diagnostic coding is a process by which written, verbal and other patient-case related documentation are used for enabling disease prediction, accurate documentation, and insurance settlements. It is a prevalently manual process even in countries that have successfully adopted Electronic Health Record (EHR) systems. The problem is exacerbated in developing countries where widespread adoption of EHR systems is still not at par with Western counterparts. EHRs contain a wealth of patient information embedded in numerical, text, and image formats. A disease prediction model that exploits all this information, enabling accurate and faster diagnosis would be quite beneficial. We address this challenging task by proposing mixed ensemble models consisting of boosting and deep learning architectures for the task of diagnostic code group prediction. The models are trained on a dataset created by integrating features from structured (lab test reports) as well as unstructured (clinical text) data. We analyze the proposed model’s performance on MIMIC-III, an open dataset of clinical data using standard multi-label metrics. Empirical evaluations underscored the significant performance of our approach for this task, compared to state-of-the-art works which rely on a single data source. Our novelty lies in effectively integrating relevant information from both data sources thereby ensuring larger ICD-9 code coverage, handling the inherent class imbalance, and adopting a novel approach to form the ensemble models. © 2021, Springer Nature Switzerland AG.
  • Item
    A novel GA-ELM model for patient-specific mortality prediction over large-scale lab event data
    (Elsevier Ltd, 2019) S. Krishnan, G.; Kamath S?, S.
    Patient-specific mortality prediction models are an essential component of Clinical Decision Support Systems developed for caregivers in Intensive Care Units (ICUs), that enable timely decisions towards effective patient care and optimized ICU resource management. While high prediction accuracy is a fundamental requirement for any mortality prediction application, being able to so with minimal patient-specific data is a major plus point that can help in improving care delivery and cost optimization. Most existing scoring techniques and prediction models utilize a multitude of lab tests and patient events to predict mortality and also suffer from reduced performance when available patient data is less. In this paper, a Genetic Algorithm based Wrapper Feature Selection technique is proposed for determining most-optimal lab events that contribute predominantly to mortality, even for large-scale patient cohorts. Using this, an Extreme Learning Machine (ELM) based neural network is designed for predicting patient-specific ICU mortality. The proposed GA-ELM model was benchmarked against four popular traditional mortality scores and also state-of-the-art machine learning models for experimental validation. The GA-ELM model achieved promising results as it outperformed the traditional scoring systems by 11%–29% and state-of-the-art models by up to 14%, in terms of AUROC. © 2019 Elsevier B.V.
  • Item
    Predicting ICD-9 code groups with fuzzy similarity based supervised multi-label classification of unstructured clinical nursing notes
    (Elsevier B.V., 2020) Gangavarapu, T.; Jayasimha, A.; S. Krishnan, G.S.; Kamath S?, S.
    In hospitals, caregivers are trained to chronicle the subtle changes in the clinical conditions of a patient at regular intervals, for enabling decision-making. Caregivers’ text-based clinical notes are a significant source of rich patient-specific data, that can facilitate effective clinical decision support, despite which, this treasure-trove of data remains largely unexplored for supporting the prediction of clinical outcomes. The application of sophisticated data modeling and prediction algorithms with greater computational capacity have made disease prediction from raw clinical notes a relevant problem. In this paper, we propose an approach based on vector space and topic modeling, to structure the raw clinical data by capturing the semantic information in the nursing notes. Fuzzy similarity based data cleansing approach was used to merge anomalous and redundant patient data. Furthermore, we utilize eight supervised multi-label classification models to facilitate disease (ICD-9 code group) prediction. We present an exhaustive comparative study to evaluate the performance of the proposed approaches using standard evaluation metrics. Experimental validation on MIMIC-III, an open database, underscored the superior performance of the proposed Term weighting of unstructured notes AGgregated using fuzzy Similarity (TAGS) model, which consistently outperformed the state-of-the-art structured data based approach by 7.79% in AUPRC and 1.24% in AUROC. © 2019 Elsevier B.V.
  • Item
    FarSight: Long-Term Disease Prediction Using Unstructured Clinical Nursing Notes
    (IEEE Computer Society, 2021) Gangavarapu, T.; S. Krishnan, G.S.; Kamath S?, S.; Jeganathan, J.
    Accurate risk stratification using patient data is a vital task in channeling prioritized care. Most state-of-the-art models are predominantly reliant on digitized data in the form of structured Electronic Health Records (EHRs). Those models overlook the valuable patient-specific information embedded in unstructured clinical notes, which is the prevalent medium employed by caregivers to record patients' disease timeline. The availability of such patient-specific data presents an unprecedented opportunity to build intelligent systems that provide exclusive insights into patients' disease physiology. Moreover, very few works have attempted to benchmark the performance of deep neural architectures against the state-of-the-art models on publicly available datasets. This article presents significant observations from our benchmarking experiments on the applicability of deep learning models for the clinical task of ICD-9 code group prediction. We present FarSight, a long-term aggregation mechanism intended to recognize the onset of the disease with the earliest detected symptoms. Vector space and topic modeling approaches are utilized to capture the semantic information in the patient representations. Experiments on MIMIC-III database underscored the superior performance of the proposed models built on unstructured data when compared to structured EHR based state-of-the-art model, achieving an improvement of 19.34 percent in AUPRC and 5.41 percent in AUROC. © 2013 IEEE.
  • Item
    Multi-scale convolutional neural network for accurate corneal segmentation in early detection of fungal keratitis
    (MDPI, 2021) Mayya, V.; Kamath S?, S.; Kulkarni, U.; Hazarika, M.; Barua, P.D.; Acharya, U.R.
    Microbial keratitis is an infection of the cornea of the eye that is commonly caused by prolonged contact lens wear, corneal trauma, pre-existing systemic disorders and other ocular surface disorders. It can result in severe visual impairment if improperly managed. According to the latest World Vision Report, at least 4.2 million people worldwide suffer from corneal opacities caused by infectious agents such as fungi, bacteria, protozoa and viruses. In patients with fungal keratitis (FK), often overt symptoms are not evident, until an advanced stage. Furthermore, it has been reported that clear discrimination between bacterial keratitis and FK is a challenging process even for trained corneal experts and is often misdiagnosed in more than 30% of the cases. However, if diagnosed early, vision impairment can be prevented through early cost-effective interventions. In this work, we propose a multi-scale convolutional neural network (MS-CNN) for accurate segmentation of the corneal region to enable early FK diagnosis. The proposed approach consists of a deep neural pipeline for corneal region segmentation followed by a ResNeXt model to differentiate between FK and non-FK classes. The model trained on the segmented images in the region of interest, achieved a diagnostic accuracy of 88.96%. The features learnt by the model emphasize that it can correctly identify dominant corneal lesions for detecting FK. © 2021 by the authors. Licensee MDPI, Basel, Switzerland.
  • Item
    An empirical study of preprocessing techniques with convolutional neural networks for accurate detection of chronic ocular diseases using fundus images
    (Springer, 2023) Mayya, V.; Kamath S․, S.K.; Kulkarni, U.; Surya, D.K.; Acharya, U.R.
    Chronic Ocular Diseases (COD) such as myopia, diabetic retinopathy, age-related macular degeneration, glaucoma, and cataract can affect the eye and may even lead to severe vision impairment or blindness. According to a recent World Health Organization (WHO) report on vision, at least 2.2 billion individuals worldwide suffer from vision impairment. Often, overt signs indicative of COD do not manifest until the disease has progressed to an advanced stage. However, if COD is detected early, vision impairment can be avoided by early intervention and cost-effective treatment. Ophthalmologists are trained to detect COD by examining certain minute changes in the retina, such as microaneurysms, macular edema, hemorrhages, and alterations in the blood vessels. The range of eye conditions is diverse, and each of these conditions requires a unique patient-specific treatment. Convolutional neural networks (CNNs) have demonstrated significant potential in multi-disciplinary fields, including the detection of a variety of eye diseases. In this study, we combined several preprocessing approaches with convolutional neural networks to accurately detect COD in eye fundus images. To the best of our knowledge, this is the first work that provides a qualitative analysis of preprocessing approaches for COD classification using CNN models. Experimental results demonstrate that CNNs trained on the region of interest segmented images outperform the models trained on the original input images by a substantial margin. Additionally, an ensemble of three preprocessing techniques outperformed other state-of-the-art approaches by 30% and 3%, in terms of Kappa and F1 scores, respectively. The developed prototype has been extensively tested and can be evaluated on more comprehensive COD datasets for deployment in the clinical setup. © 2022, The Author(s).
  • Item
    Content-based medical retrieval systems with evidence-based diagnosis for enhanced clinical decision support
    (Elsevier Ltd, 2025) Karthik, K.; S, S.K.; Supreetha, S.; Katlam, A.
    In the medical field, making accurate decisions during treatment is crucial. Incorrect decisions can lead to misdiagnoses, resulting in patient mismanagement and severe consequences. Clinical Decision Support Systems (CDSS) are essential in aiding doctors with critical medical decisions by providing precise and informative diagnostic recommendations. Despite the extensive availability of both textual and graphical electronic health records (EHR), current systems often fail to fully utilize all available data. Most systems rely predominantly on textual patient reports, while integrating findings from medical images is vital for accurate diagnoses. To address this gap, we propose an advanced system that incorporates medical image classification using a Content-Based Medical Image Retrieval (CBMIR) system in CDSSs, to enable evidence-based diagnosis. The proposed system leverages advanced AI algorithms to improve disease localization, recognition, and classification, of specific thoracic diseases using X-ray medical images that can be used for other imaging modalities like MRIs, and CT scans by the CDSS in future. The system also incorporates classification-based image filtering and tree-based similarity matching algorithms for optimized retrieval performance. The system achieved promising performance with a Mean Average Precision of 0.66?0.85 for Top-5 retrieval with time complexity of O(logn). © 2025 Elsevier Ltd