Journal Articles
Permanent URI for this collectionhttps://idr.nitk.ac.in/handle/123456789/19884
Browse
9 results
Search Results
Item LiverNet: efficient and robust deep learning model for automatic diagnosis of sub-types of liver hepatocellular carcinoma cancer from H&E stained liver histopathology images(Springer Science and Business Media Deutschland GmbH, 2021) Aatresh, A.A.; Alabhya, K.; Lal, S.; Kini, J.; Saxena, P.P.Purpose: Liver cancer is one of the most common types of cancers in Asia with a high mortality rate. A common method for liver cancer diagnosis is the manual examination of histopathology images. Due to its laborious nature, we focus on alternate deep learning methods for automatic diagnosis, providing significant advantages over manual methods. In this paper, we propose a novel deep learning framework to perform multi-class cancer classification of liver hepatocellular carcinoma (HCC) tumor histopathology images which shows improvements in inference speed and classification quality over other competitive methods. Method: The BreastNet architecture proposed by Togacar et al. shows great promise in using convolutional block attention modules (CBAM) for effective cancer classification in H&E stained breast histopathology images. As part of our experiments with this framework, we have studied the addition of atrous spatial pyramid pooling (ASPP) blocks to effectively capture multi-scale features in H&E stained liver histopathology data. We classify liver histopathology data into four classes, namely the non-cancerous class, low sub-type liver HCC tumor, medium sub-type liver HCC tumor, and high sub-type liver HCC tumor. To prove the robustness and efficacy of our models, we have shown results for two liver histopathology datasets—a novel KMC dataset and the TCGA dataset. Results: Our proposed architecture outperforms state-of-the-art architectures for multi-class cancer classification of HCC histopathology images, not just in terms of quality of classification, but also in computational efficiency on the novel proposed KMC liver data and the publicly available TCGA-LIHC dataset. We have considered precision, recall, F1-score, intersection over union (IoU), accuracy, number of parameters, and FLOPs as metrics for comparison. The results of our meticulous experiments have shown improved classification performance along with added efficiency. LiverNet has been observed to outperform all other frameworks in all metrics under comparison with an approximate improvement of 2 % in accuracy and F1-score on the KMC and TCGA-LIHC datasets. Conclusion: To the best of our knowledge, our work is among the first to provide concrete proof and demonstrate results for a successful deep learning architecture to handle multi-class HCC histopathology image classification among various sub-types of liver HCC tumor. Our method shows a high accuracy of 90.93 % on the proposed KMC liver dataset requiring only 0.5739 million parameters and 1.1934 million floating point operations per second. © 2021, CARS.Item Efficient deep learning architecture with dimension-wise pyramid pooling for nuclei segmentation of histopathology images(Elsevier Ltd, 2021) Aatresh, A.A.; Yatgiri, R.P.; Chanchal, A.K.; Kumar, A.; Ravi, A.; Das, D.; Raghavendra, B.S.; Lal, S.; Kini, J.Image segmentation remains to be one of the most vital tasks in the area of computer vision and more so in the case of medical image processing. Image segmentation quality is the main metric that is often considered with memory and computation efficiency overlooked, limiting the use of power hungry models for practical use. In this paper, we propose a novel framework (Kidney-SegNet) that combines the effectiveness of an attention based encoder-decoder architecture with atrous spatial pyramid pooling with highly efficient dimension-wise convolutions. The segmentation results of the proposed Kidney-SegNet architecture have been shown to outperform existing state-of-the-art deep learning methods by evaluating them on two publicly available kidney and TNBC breast H&E stained histopathology image datasets. Further, our simulation experiments also reveal that the computational complexity and memory requirement of our proposed architecture is very efficient compared to existing deep learning state-of-the-art methods for the task of nuclei segmentation of H&E stained histopathology images. The source code of our implementation will be available at https://github.com/Aaatresh/Kidney-SegNet. © 2021 Elsevier LtdItem Automated Molecular Subtyping of Breast Carcinoma Using Deep Learning Techniques(Institute of Electrical and Electronics Engineers Inc., 2023) Niyas, S.; Bygari, R.; Naik, R.; Viswanath, B.; Ugwekar, D.; Mathew, T.; Kavya, J.; Kini, J.R.; Rajan, J.Objective: Molecular subtyping is an important procedure for prognosis and targeted therapy of breast carcinoma, the most common type of malignancy affecting women. Immunohistochemistry (IHC) analysis is the widely accepted method for molecular subtyping. It involves the assessment of the four molecular biomarkers namely estrogen receptor (ER), progesterone receptor (PR), human epidermal growth factor receptor 2 (HER2), and antigen Ki67 using appropriate antibody reagents. Conventionally, these biomarkers are assessed manually by a pathologist, who finally combines individual results to identify the molecular subtype. Molecular subtyping necessitates the status of all the four biomarkers together, and to the best of our knowledge, no such automated method exists. This paper proposes a novel deep learning framework for automatic molecular subtyping of breast cancer from IHC images. Methods and procedures: A modified LadderNet architecture is proposed to segment the immunopositive elements from ER, PR, HER2, and Ki67 biomarker slides. This architecture uses long skip connections to pass encoder feature space from different semantic levels to the decoder layers, allowing concurrent learning with multi-scale features. The entire architecture is an ensemble of multiple fully convolutional neural networks, and learning pathways are chosen adaptively based on input data. The segmentation stage is followed by a post-processing stage to quantify the extent of immunopositive elements to predict the final status for each biomarker. Results: The performance of segmentation models for each IHC biomarker is evaluated qualitatively and quantitatively. Furthermore, the biomarker prediction results are also evaluated. The results obtained by our method are highly in concordance with manual assessment by pathologists. Clinical impact: Accurate automated molecular subtyping can speed up this pathology procedure, reduce pathologists' workload and associated costs, and facilitate targeted treatment to obtain better outcomes. © 2013 IEEE.Item A novel dataset and efficient deep learning framework for automated grading of renal cell carcinoma from kidney histopathology images(Nature Research, 2023) Chanchal, A.K.; Lal, S.; Kumar, R.; Kwak, J.T.; Kini, J.Trends of kidney cancer cases worldwide are expected to increase persistently and this inspires the modification of the traditional diagnosis system to respond to future challenges. Renal Cell Carcinoma (RCC) is the most common kidney cancer and responsible for 80–85% of all renal tumors. This study proposed a robust and computationally efficient fully automated Renal Cell Carcinoma Grading Network (RCCGNet) from kidney histopathology images. The proposed RCCGNet contains a shared channel residual (SCR) block which allows the network to learn feature maps associated with different versions of the input with two parallel paths. The SCR block shares the information between two different layers and operates the shared data separately by providing beneficial supplements to each other. As a part of this study, we also introduced a new dataset for the grading of RCC with five different grades. We obtained 722 Hematoxylin & Eosin (H &E) stained slides of different patients and associated grades from the Department of Pathology, Kasturba Medical College (KMC), Mangalore, India. We performed comparable experiments which include deep learning models trained from scratch as well as transfer learning techniques using pre-trained weights of the ImageNet. To show the proposed model is generalized and independent of the dataset, we experimented with one additional well-established data called BreakHis dataset for eight class-classification. The experimental result shows that proposed RCCGNet is superior in comparison with the eight most recent classification methods on the proposed dataset as well as BreakHis dataset in terms of prediction accuracy and computational complexity. © 2023, The Author(s).Item Assessment of land use and land cover change detection and prediction using deep learning techniques for the southwestern coastal region, Goa, India(Springer Science and Business Media Deutschland GmbH, 2024) Naik, N.; Chandrasekaran, K.; Sundaram, V.; Prabhavathy, P.Understanding the connections between human activities and the natural environment depends heavily on information about land use and land cover (LULC) in the form of accurate LULC maps. Environmental monitoring using deep learning (DL) is rapidly growing to preserve a sustainable environment in the long term. For establishing effective policies, regulations, and implementation, DL can be a valuable tool for assessing environmental conditions and natural resources that will positively impact the ecosystem. This paper presents the assessment of land use and land cover change detection (LULCCD) and prediction using DL techniques for the southwestern coastal region, Goa, also known as the tourist destination of India. It consists of three components: (i) change detection (CD), (ii) quantification of LULC changes, and (iii) prediction. A new CD assessment framework, Spatio-Temporal Encoder-Decoder Self Attention Network (STEDSAN), is proposed for the LULCCD process. A dual branch encoder-decoder network is constructed using strided convolution with downsampling for the encoder and transpose convolution with upsampling for the decoder to assess the bitemporal images spatially. The self-attention (SA) mechanism captures the complex global spatial-temporal (ST) interactions between individual pixels over space-time to produce more distinct features. Each branch accepts the LULC map of 2 years as one of its inputs to determine binary and multiclass changes among the bitemporal images. The STEDSAN model determines the patterns, trends, and conversion from one LULC type to another for the assessment period from 2005 to 2018. The binary change maps were also compared with the existing state of the art (SOTA) CD methods, with STEDSAN having an overall accuracy of 94.93%. The prediction was made using an recurrent neural network (RNN) known as long short term memory network (LSTM) for the year 2025. Experiments were conducted to determine area-wise changes in several LULC classes, such as built-up (BU), crops (kharif crop (KC), rabi crop (RC), zaid crop (ZC), double/triple (D/T C)), current fallow (CF), plantation (PL), forests (evergreen forest (EF), deciduous forest (DF), degraded/scurb forest (D/SF)), littoral swamp (LS), grassland (GL), wasteland (WL), waterbodies max (Wmx), and waterbodies min (Wmn). As per the analysis, over the period of 13 years, there has been a net increase in the amount of BU (1.25%), RC (1.17%), and D/TC(2.42%) and a net decrease in DF (3.29%) and WL(1.44%) being the most dominant classes being changed. These findings will offer a thorough description of identifying trends in coastal areas that may incorporate methodological hints for future studies. This study will also promote handling the spatial and temporal complexity of remotely sensed data employed in categorizing the coastal LULC of a heterogeneous landscape. © The Author(s), under exclusive licence to Springer Nature Switzerland AG 2024.Item ADConv-Net: Advanced Deep Convolution Neural Network for COVID-19 Diagnostics Using Chest X-Ray and CT Images(Springer, 2025) Kumar, S.; Bhowmik, B.The worldwide COVID-19 epidemic has emerged as a significant concern, affecting daily lives and underscoring the importance of early diagnosis for effective treatment in medical and healthcare settings. Current diagnostic testing for COVID-19 is sluggish, typically requiring hours to get results. Detection of COVID-19 from medical imaging presents a challenging task that has gained substantial interest from experts worldwide. Essential imaging modalities for diagnosing COVID-19 include chest X-rays and computed tomography (CT) scans. By contrast, most of the chest radiography can be completed in within fifteen minutes. Thus, employing chest radiography gives a possibility for early and reliable diagnosis of COVID-19, intending to relieve therapeutic obstacles for patients and speed up the diagnostic process. Recently, deep learning (DL) techniques have been shown to be effective in image-based diagnostics. This paper proposed an advanced deep convolution neural network (ADConv-Net) for COVID-19 detection and categorization using chest X-ray and CT images. The proposed technique is not only capable of recognizing critical connections and similarities in image classification, but also leads to improved diagnostic accuracy. The proposed model undergoes thorough evaluation for standard performance metrics. After evaluation, the ADConv-Net model achieves high accuracies of 98.84% and 97.25% in training and testing for X-ray images and 99.41% and 98.87% in training and testing for CT images, respectively. Additionally, the proposed model demonstrates strong performance, with AUC values of 0.993 and 0.996 for X-ray and CT images, respectively. Further, the model also introduces a heatmap approach for displaying COVID-19 disease areas. Subsequently, radiologists can find COVID-19 disorders in chest X-ray and CT images with this approach. © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd. 2025.Item EffiCOVID-net: A highly efficient convolutional neural network for COVID-19 diagnosis using chest X-ray imaging(Academic Press Inc., 2025) Kumar, S.; Bhowmik, B.The global COVID-19 pandemic has drastically affected daily life, emphasizing the urgent need for early and accurate detection to provide adequate medical treatment, especially with limited antiviral options. Chest X-ray imaging has proven crucial for distinguishing COVID-19 from other respiratory conditions, providing an essential diagnostic tool. Deep learning (DL)-based models have proven highly effective in image diagnostics in recent years. Many of these models are computationally intensive and prone to overfitting, especially when trained on limited datasets. Additionally, conventional models often fail to capture multi-scale features, reducing diagnostic accuracy. This paper proposed a highly efficient convolutional neural network (CNN) called EffiCOVID-Net, incorporating diverse feature learning units. The proposed model consists of a bunch of EffiCOVID blocks that incorporate several layers of convolution containing (3×3) filters and recurrent connections to extract complex features while preserving spatial integrity. The performance of EffiCOVID-Net is rigorously evaluated using standard performance metrics on two publicly available COVID-19 chest X-ray datasets. Experimental results demonstrate that EffiCOVID-Net outperforms existing models, achieving 98.68% accuracy on the COVID-19 radiography dataset (D1), 98.55% on the curated chest X-ray dataset (D2), and 98.87% on the mixed dataset (DMix) in multi-class classification (COVID-19 vs. Normal vs. Pneumonia). For binary classification (COVID-19 vs. Normal), the model attains 99.06%, 99.78%, and 99.07% accuracy, respectively. Integrating Grad-CAM-based visualizations further enhances interpretability by highlighting critical regions influencing model predictions. EffiCOVID-Net's lightweight architecture ensures low computational overhead, making it suitable for deployment in resource-constrained clinical settings. A comparative analysis with existing methods highlights its superior accuracy, efficiency, and robustness performance. However, while the model enhances diagnostic workflows, it is best utilized as an assistive tool rather than a standalone diagnostic method. © 2025 Elsevier Inc.Item Development and evaluation of deep neural networks for the classification of subtypes of renal cell carcinoma from kidney histopathology images(Nature Research, 2025) Chanchal, A.K.; Lal, S.; Suresh, S.Kidney cancer is a leading cause of cancer-related mortality, with renal cell carcinoma (RCC) being the most prevalent form, accounting for 80–85% of all renal tumors. Traditional diagnosis of kidney cancer requires manual examination and analysis of histopathology images, which is time-consuming, error-prone, and depends on the pathologist’s expertise. Recently, deep learning algorithms have gained significant attention in histopathology image analysis. In this study, we developed an efficient and robust deep learning architecture called RenalNet for the classification of subtypes of RCC from kidney histopathology images. The RenalNet is designed to capture cross-channel and inter-spatial features at three different scales simultaneously and combine them together. Cross-channel features refer to the relationships and dependencies between different data channels, while inter-spatial features refer to patterns within small spatial regions. The architecture contains a CNN module called multiple channel residual transformation (MCRT), to focus on the most relevant morphological features of RCC by fusing the information from multiple paths. Further, to improve the network’s representation power, a CNN module called Group Convolutional Deep Localization (GCDL) has been introduced, which effectively integrates three different feature descriptors. As a part of this study, we also introduced a novel benchmark dataset for the classification of subtypes of RCC from kidney histopathology images. We obtained digital hematoxylin and eosin (H&E) stained WSIs from The Cancer Genome Atlas (TCGA) and acquired region of interest (ROIs) under the supervision of experienced pathologists resulted in the creation of patches. To demonstrate that the proposed model is generalized and independent of the dataset, it has experimented on three well-known datasets. Compared to the best-performing state-of-the-art model, RenalNet achieves accuracies of 91.67%, 97.14%, and 97.24% on three different datasets. Additionally, the proposed method significantly reduces the number of parameters and FLOPs, demonstrating computationally efficient with 2.71 × FLOPs & 0.2131 × parameters. © The Author(s) 2025.Item Multi head attention based deep learning framework for waxberry fruit object segmentation from high resolution remote sensing images(Nature Research, 2025) Vaghela, R.; Sravya, N.; Lal, S.; Sarda, J.; Thakkar, A.; Patil, S.In some Asian countries, waxberries are special fruit that demand substantial labour for harvesting each season. To ease this burden, automated fruit-picking equipment has seen extensive development over the past decade. However, accurately segmenting waxberries in orchards remains challenging due to complex environments with overlapping fruits, foliage occlusions, and variable lighting conditions. Most existing segmentation methods are optimized for controlled environments with steady lighting and unobstructed views of the fruit, which limits their effectiveness in real-world scenarios. This paper introduces a fully convolutional neural network namely Multi-Attention Waxberry Network (MAWNet) which effectively addresses challenges such as occlusions, overlapping fruits and variable lighting conditions. The MAWNet is a UNet based architecture and it consist of enhanced residual block, transformer block, Atrous Spatial Pyramid Pooling (ASPP) block and introduced Multiple Dilation Convolutional (MDC) Block. The experimental results validate that the proposed MAWNet model surpasses several State-of-the-Art (SOTA) architectures, in terms of performance with achieving a remarkable accuracy of 99.63%, an Intersection over Union (IoU) of 96.77%, and a Dice coefficient of 98.34%. © The Author(s) 2025.
