Faculty Publications
Permanent URI for this communityhttps://idr.nitk.ac.in/handle/123456789/18736
Publications by NITK Faculty
Browse
8 results
Search Results
Item A Lightweight Convolutional Neural Network Model for Tuberculosis Bacilli Detection From Microscopic Sputum Smear Images(wiley, 2021) Panicker, R.O.; Pawan, S.J.; Rajan, J.; Sabu, M.K.This chapter describes a lightweight convolutional neural network model that automatically detects Tuberculosis (TB) bacilli from sputum smear microscopic images. According to WHO, about onefourth of the population in the universe is infected with TB, and every day five thousand people are killed due to TB disease. There are well-known recommended diagnostics are available for TB detection, among them sputum smear microscopic examination is a primary and most efficient recommended method for most of the developing and moderately developed countries. However, this manual detection method is highly error-prone and time-consuming. In this chapter, we proposed a lightweight CNN model for classifying Tuberculosis bacilli from non-bacilli objects. We adopted a Convolutional Neural Network (CNN) architecture with a skip connection of variable lengths that can identify TB bacilli from sputum smear microscopic images. The performance of the proposed model in terms of accuracy is close to the state-of-the-art. However, the number of parameters in the proposed model is significantly less than other recently proposed models. © 2021 Scrivener Publishing LLC.Item An Automated Approach for Screening COVID-19 from Thermal Images Using Convolutional Neural Network(Springer Science and Business Media Deutschland GmbH, 2022) Srivastava, D.K.; Pawan, S.J.; Rajan, J.The world has seen the disastrous effect caused by COVID-19 on humankind. The rapidity of COVID-19 transmission, re-infections, post-COVID-19 symptoms, and the emergence of new COVID-19 strands have disrupted the global healthcare systems. Consequently, screening for COVID-19 cases has become of the utmost importance. As temperature and mask checks help significantly to prevent the rapid spread of COVID-19, automating this process in public places has become indispensable. In this work, we propose an end-to-end approach for mask detection followed by temperature for efficient screening. The proposed model achieved 93.5%, 96.7%, and 97.7% precision, recall, and mAP when trained on the thermal surveillance dataset and tested on a lightning dataset consisting of images with varying intensities. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.Item Cross Task Temporal Consistency for Semi-supervised Medical Image Segmentation(Springer Science and Business Media Deutschland GmbH, 2022) Jeevan, G.; Pawan, S.J.; Rajan, J.Semi-supervised deep learning for medical image segmentation is an intriguing area of research as far as the requirement for an adequate amount of labeled data is concerned. In this context, we propose Cross Task Temporal Consistency, a novel Semi-Supervised Learning framework that combines a self-ensembled learning strategy with cross-consistency constraints derived from the implicit perturbations between the incongruous tasks of multi-headed architectures. More specifically, the Signed Distance Map output of a teacher model is transformed to an approximate segmentation map which acts as a pseudo target for the student model. Simultaneously, the teacher’s segmentation task output is utilized as the objective for the student’s Signed Distance Map derived segmentation output. Our proposed framework is intuitively simple and can be plugged into existing segmentation architectures with minimal computational overhead. Our work focuses on improving the segmentation performance in very low-labeled data proportions and has demonstrated marked superiority in performance and stability over existing SSL techniques, as evidenced through extensive evaluations on two standard datasets: ACDC and LA. © 2022, Springer Nature Switzerland AG.Item Multi-Res-Attention UNet: A CNN Model for the Segmentation of Focal Cortical Dysplasia Lesions from Magnetic Resonance Images(Institute of Electrical and Electronics Engineers Inc., 2021) Thomas, E.; Pawan, S.J.; Kumar, S.; Horo, A.; Niyas, S.; Vinayagamani, S.; Kesavadas, C.; Rajan, J.In this work, we have focused on the segmentation of Focal Cortical Dysplasia (FCD) regions from MRI images. FCD is a congenital malformation of brain development that is considered as the most common causative of intractable epilepsy in adults and children. To our knowledge, the latest work concerning the automatic segmentation of FCD was proposed using a fully convolutional neural network (FCN) model based on UNet. While there is no doubt that the model outperformed conventional image processing techniques by a considerable margin, it suffers from several pitfalls. First, it does not account for the large semantic gap of feature maps passed from the encoder to the decoder layer through the long skip connections. Second, it fails to leverage the salient features that represent complex FCD lesions and suppress most of the irrelevant features in the input sample. We propose Multi-Res-Attention UNet; a novel hybrid skip connection-based FCN architecture that addresses these drawbacks. Moreover, we have trained it from scratch for the detection of FCD from 3 T MRI 3D FLAIR images and conducted 5-fold cross-validation to evaluate the model. FCD detection rate (Recall) of 92% was achieved for patient wise analysis. © 2013 IEEE.Item Capsule Network–based architectures for the segmentation of sub-retinal serous fluid in optical coherence tomography images of central serous chorioretinopathy(Springer Science and Business Media Deutschland GmbH, 2021) Pawan, S.J.; Sankar, R.; Jain, A.; Jain, M.; Darshan, D.V.; Anoop, B.N.; Kothari, A.R.; Venkatesan, M.; Rajan, J.Central serous chorioretinopathy (CSCR) is a chorioretinal disorder of the eye characterized by serous detachment of the neurosensory retina at the posterior pole of the eye. CSCR results from the accumulation of subretinal fluid (SRF) due to idiopathic defects at the level of the retinal pigment epithelial (RPE) that allows serous fluid from the choriocapillaris to diffuse into the subretinal space between RPE and neurosensory retinal layers. This condition is presently investigated by clinicians using invasive angiography or non-invasive optical coherence tomography (OCT) imaging. OCT images provide a representation of the fluid underlying the retina, and in the absence of automated segmentation tools, currently only a qualitative assessment of the same is used to follow the progression of the disease. Automated segmentation of the SRF can prove to be extremely useful for the assessment of progression and for the timely management of CSCR. In this paper, we adopt an existing architecture called SegCaps, which is based on the recently introduced Capsule Networks concept, for the segmentation of SRF from CSCR OCT images. Furthermore, we propose an enhancement to SegCaps, which we have termed as DRIP-Caps, that utilizes the concepts of Dilation, Residual Connections, Inception Blocks, and Capsule Pooling to address the defined problem. The proposed model outperforms the benchmark UNet architecture while reducing the number of trainable parameters by 54.21%. Moreover, it reduces the computation complexity of SegCaps by reducing the number of trainable parameters by 37.85%, with competitive performance. The experiments demonstrate the generalizability of the proposed model, as evidenced by its remarkable performance even with a limited number of training samples. [Figure not available: see fulltext.]. © 2021, International Federation for Medical and Biological Engineering.Item Semi-supervised structure attentive temporal mixup coherence for medical image segmentation(Elsevier B.V., 2022) Pawan, S.J.; Jeevan, G.; Rajan, J.Deep convolutional neural networks have shown eminent performance in medical image segmentation in supervised learning. However, this success is predicated on the availability of large volumes of pixel-level labeled data, making these approaches impractical when labeled data is scarce. On the other hand, semi-supervised learning utilizes pertinent information from unlabeled data along with minimal labeled data, alleviating the demand for labeled data. In this paper, we leverage the mixup-based risk minimization operator in a student–teacher-based semi-supervised paradigm along with structure-aware constraints to enforce consistency coherence among the student predictions for unlabeled samples and the teacher predictions for the corresponding mixup sample by significantly diminishing the need for labeled data. Besides, due to the intrinsic simplicity of the linear combination operation used for generating mixup samples, the proposed method stands at a computational advantage over existing consistency regularization-based SSL methods. We experimentally validate the performance of the proposed model on two public benchmark datasets, namely the Left Atrial (LA) and Automatic Cardiac Diagnosis Challenge (ACDC) datasets. Notably, on the LA dataset's lowest labeled data set-up (5%), the proposed method significantly improved the Dice Similarity Coefficient and the Jaccard Similarity Coefficient by 1.08% and 1.46%, respectively. Furthermore, we demonstrate the efficacy of the proposed method with a consistent improvement across various labeled data proportions on the aforementioned datasets. © 2022 Nalecz Institute of Biocybernetics and Biomedical Engineering of the Polish Academy of SciencesItem WideCaps: a wide attention-based capsule network for image classification(Springer Science and Business Media Deutschland GmbH, 2023) Pawan, S.J.; Sharma, R.; Reddy, H.; Vani, M.; Rajan, J.The capsule network is a distinct and promising segment of the neural network family that has drawn attention due to its unique ability to maintain equivariance by preserving spatial relationships among the features. The capsule network has attained unprecedented success in image classification with datasets such as MNIST and affNIST by encoding the characteristic features into capsules and building a parse-tree structure. However, on datasets involving complex foreground and background regions, such as CIFAR-10 and CIFAR-100, the performance of the capsule network is suboptimal due to its naive data routing policy and incompetence in extracting complex features. This paper proposes a new design strategy for capsule network architectures for efficiently dealing with complex images. The proposed method incorporates the optimal placement of the novel wide bottleneck residual block and squeeze and excitation Attention Blocks into the capsule network upheld by the modified factorized machines routing algorithm to address the defined problem. This setup allows channel interdependencies at almost no computational cost, thereby enhancing the representation ability of capsules on complex images. We extensively evaluate the performance of the proposed model on the five publicly available datasets, namely the CIFAR-10, Fashion MNIST, Brain Tumor, SVHN, and the CIFAR-100 datasets. The proposed method outperformed the top-5 capsule network-based methods on Fashion MNIST, CIFAR-10, SVHN, Brain Tumor, and gave a highly competitive performance on the CIFAR-100 datasets. © 2023, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.Item A Dual-Stage Semi-Supervised Pre-Training Approach for Medical Image Segmentation(Institute of Electrical and Electronics Engineers Inc., 2024) Aralikatti, R.C.; Pawan, S.J.; Rajan, J.Deep neural networks have played a vital role in developing automated methods for addressing medical image segmentation. However, their reliance on labeled data impedes the practicability. Semi-Supervised learning is gaining attention for its intrinsic ability to extract valuable information from labeled and unlabeled data with improved performance. Recently, consistency regularization methods have gained interest due to their efficient learning procedures. They are, however, confined to data or network-level perturbations, negating the benefit of having both forms in a single framework. In light of this, we ask an intriguing but unexplored question: Can we have both network-level and data-level perturbation in the semi-supervised framework? To this end, we present a holistic approach that integrates data-level perturbation in the model pre-training stage, followed by implicit network-level perturbation in the fine-tuning stage. Furthermore, we incorporate networks with manifold learning paradigms throughout the training to facilitate the formation of robust data representations by ensuring local and global semantic affinities adhering to the theory of consensus. Notably, this may be the first attempt in the semi-supervised medical image segmentation archetype to use data and network-level perturbation with a model pre-training strategy. We extensively validated the efficacy of the proposed framework on three benchmark datasets, namely the Automated Cardiac Diagnosis Challenge, ISIC-2018, and Left Atrial Segmentation Challenge datasets, subjected to severely low-sampled labeled data. Notably, in ACDC (4%), ISIC-2018 (5%), and LA (6%) labeled cases, the proposed method outperforms the second-best method by 2.95%, 1.31%, and 0.71% in the Dice Similarity Metric. © 2023 IEEE.
