Faculty Publications

Permanent URI for this communityhttps://idr.nitk.ac.in/handle/123456789/18736

Publications by NITK Faculty

Browse

Search Results

Now showing 1 - 3 of 3
  • Item
    Stroke classification from computed tomography scans using 3D convolutional neural network
    (Elsevier Ltd, 2022) Neethi, A.S.; Niyas, S.; Kannath, S.K.; Mathew, J.; Anzar, A.M.; Rajan, J.
    Stroke is a cerebrovascular condition with a significant morbidity and mortality rate and causes physical disabilities for survivors. Once the symptoms are identified, it requires a time-critical diagnosis with the help of the most commonly available imaging techniques. Computed tomography (CT) scans are used worldwide for preliminary stroke diagnosis. It demands the expertise and experience of a radiologist to identify the stroke type, which is critical for initiating the treatment. This work attempts to gather those domain skills and build a model from CT scans to diagnose stroke. The non-contrast computed tomography (NCCT) scan of the brain comprises volumetric images or a 3D stack of image slices. So, a model that aims to solve the problem by targeting a 2D slice may fail to address the volumetric nature. We propose a 3D-based fully convolutional classification model to identify stroke cases from CT images that take into account the contextual longitudinal composition of volumetric data. We formulate a custom pre-processing module to enhance the scans and aid in improving the classification performance. Some of the significant challenges faced by 3D CNN are the less number of training samples, and the number of scans is mostly biased in favor of normal patients. In this work, the limitation of insufficient training volume and class imbalanced data have been rectified with the help of a strided slicing approach. A block-wise design was used to formulate the proposed network, with the initial part focusing on adjusting the dimensionality, at the same time retaining the features. Later on, the accumulated feature maps were effectively learned utilizing bundled convolutions and skip connections. The results of the proposed method were compared against 3D CNN stroke classification models on NCCT, various 3D CNN architectures on other brain imaging modalities, and 3D extensions of some of the classical CNN architectures. The proposed method achieved an improvement of 14.28% in the F1-score over the state-of-the-art 3D CNN stroke classification model. © 2022 Elsevier Ltd
  • Item
    StrokeViT with AutoML for brain stroke classification
    (Elsevier Ltd, 2023) Raj, R.; Mathew, J.; Kannath, S.K.; Rajan, J.
    Stroke, categorized under cardiovascular and circulatory diseases, is considered the second foremost cause of death worldwide, causing approximately 11% of deaths annually. Stroke diagnosis using a Computed Tomography (CT) scan is considered ideal for identifying whether the stroke is hemorrhagic or ischemic. However, most methods for stroke classification are based on a single slice-level prediction mechanism, meaning that the most imperative CT slice has to be manually selected by the radiologist from the original CT volume. This paper proposes an integration of Convolutional Neural Network (CNN), Vision Transformers (ViT), and AutoML to obtain slice-level predictions as well as patient-wise prediction results. While the CNN with inductive bias captures local features, the transformer captures long-range dependencies between sequences. This collaborative local-global feature extractor improves upon the slice-wise predictions of the CT volume. We propose stroke-specific feature extraction from each slice-wise prediction to obtain the patient-wise prediction using AutoML. While the slice-wise predictions helps the radiologist to verify close and corner cases, the patient-wise predictions makes the outcome clinically relevant and closer to real-world scenario. The proposed architecture has achieved an accuracy of 87% for single slice-level prediction and an accuracy of 92% for patient-wise prediction. For comparative analysis of slice-level predictions, standalone architectures of VGG-16, VGG-19, ResNet50, and ViT were considered. The proposed architecture has outperformed the standalone architectures by 9% in terms of accuracy. For patient-wise predictions, AutoML considers 13 different ML algorithms, of which 3 achieve an accuracy of more than 90%. The proposed architecture helps in reducing the manual effort by the radiologist to manually select the most imperative CT from the original CT volume and shows improvement over other standalone architectures for classification tasks. The proposed architecture can be generalized for volumetric scans aiding in the patient diagnosis of head and neck, lungs, diseases of hepatobiliary tract, genitourinary diseases, women's imaging including breast cancer and various musculoskeletal diseases. Code for proposed stroke-specific feature extraction with the pre-trained weights of the trained model is available at: https://github.com/rishiraj-cs/StrokeViT_With_AutoML. © 2022 Elsevier Ltd
  • Item
    Multilevel Multimodal Framework for Automatic Collateral Scoring in Brain Stroke
    (Institute of Electrical and Electronics Engineers Inc., 2024) Raj, R.; Dayananda, D.; Gupta, A.; Mathew, J.; Kannath, S.K.; Prakash, A.; Rajan, J.
    In patients with ischemic brain stroke, collateral circulation plays a crucial role in selecting patients suitable for endovascular therapy. The presence of well-developed collaterals improves the patient's chances of recovery. In clinical practice, the presence of collaterals is diagnosed on a Computed Tomography Angiography scan. The radiologist grades it on the basis of subjective visual assessment, which is prone to interobserver and intraobserver variability. Computer-based methods of collateral assessment face the challenge of non-uniform scan volume, leading to manual selection of slices, meaning that the most imperative slices have to be manually selected by the radiologist. This paper proposes a multilevel multimodal hierarchical framework for automated collateral scoring. Specifically, we propose deploying a Convolutional Neural Network for image selection based on the visibility of collaterals and a multimodal model for comparing the occluded and contralateral sides of the brain for collateral scoring. We also generate a patient-level prediction by integrating automated machine learning in the proposed framework. While the proposed multimodal predictor contributes to Artificial Intelligence, the proposed end-to-end framework is an application in engineering. The proposed framework has been trained and tested on 116 patients, with five-fold cross-validation, achieving an accuracy of 91.17% for multi-class collateral scores and 94.118% for binary class collateral scores. The proposed multimodal predictor achieved a weighted F1 score of 0.86 and 0.95 on multi-class and binary-class collateral scores, respectively. The proposed framework is fast, efficient, and scalable for real-world deployments. Automated evaluation of collaterals with attention maps for explainability would complement radiologists' efforts. Code for the proposed framework is available at: https://github.com/rishiraj-cs/collaterals_ML_MM. © 2013 IEEE.