Journal Articles

Permanent URI for this collectionhttps://idr.nitk.ac.in/handle/123456789/19884

Browse

Search Results

Now showing 1 - 7 of 7

Semantic Segmentation of Remotely Sensed Images for Land-use and Land-cover Classification: A Comprehensive Review
(Taylor and Francis Ltd., 2025) Putty, A.; Annappa, B.; Pariserum Perumal, S.
Remotely Sensed Images (RSI) based land-use and land-cover (LULC) mapping facilitates applications such as forest logging, biodiversity protection, and urban topographical kinetics. This process has gained more attention with the widespread availability of geospatial and remote sensing data. With recent advances in machine learning and the possibility of processing nearly real-time information on the computer, LULC mapping methods broadly fall into two categories: (i) framework-dependent algorithms, where mappings are done using the in-built algorithms in Geographical Information System (GIS) software and (ii) framework-independent algorithms, which are mainly based on deep learning techniques. Both approaches have their unique advantages and challenges. Along with the working patterns and performances of these two methodologies, this comprehensive review thoroughly analyzes deep learning architectures catering different technical capabilities like feature extraction, boundary extraction, transformer-based mechanism based mechanism, attention mechanism, pyramid pooling and lightweight models. To fine-tune these semantic segmentation processes, current technical and domain challenges and insights into future directions for analysing RSIs of varying spatial and temporal resolutions are summarized. Cross domain users with application specific requirements can make use of this study to select appropriate LULC semantic segmentation models. © 2025 IETE.
COVID-19: Automatic detection from X-ray images by utilizing deep learning methods
(Elsevier Ltd, 2021) Nigam, B.; Nigam, A.; Jain, R.; Dodia, S.; Arora, N.; Annappa, B.
In recent months, a novel virus named Coronavirus has emerged to become a pandemic. The virus is spreading not only humans, but it is also affecting animals. First ever case of Coronavirus was registered in city of Wuhan, Hubei province of China on 31st of December in 2019. Coronavirus infected patients display very similar symptoms like pneumonia, and it attacks the respiratory organs of the body, causing difficulty in breathing. The disease is diagnosed using a Real-Time Reverse Transcriptase Polymerase Chain reaction (RT-PCR) kit and requires time in the laboratory to confirm the presence of the virus. Due to insufficient availability of the kits, the suspected patients cannot be treated in time, which in turn increases the chance of spreading the disease. To overcome this solution, radiologists observed the changes appearing in the radiological images such as X-ray and CT scans. Using deep learning algorithms, the suspected patients’ X-ray or Computed Tomography (CT) scan can differentiate between the healthy person and the patient affected by Coronavirus. In this paper, popular deep learning architectures are used to develop a Coronavirus diagnostic systems. The architectures used in this paper are VGG16, DenseNet121, Xception, NASNet, and EfficientNet. Multiclass classification is performed in this paper. The classes considered are COVID-19 positive patients, normal patients, and other class. In other class, chest X-ray images of pneumonia, influenza, and other illnesses related to the chest region are included. The accuracies obtained for VGG16, DenseNet121, Xception, NASNet, and EfficientNet are 79.01%, 89.96%, 88.03%, 85.03% and 93.48% respectively. The need for deep learning with radiologic images is necessary for this critical condition as this will provide a second opinion to the radiologists fast and accurately. These deep learning Coronavirus detection systems can also be useful in the regions where expert physicians and well-equipped clinics are not easily accessible. © 2021 Elsevier Ltd
A novel receptive field-regularized V-net and nodule classification network for lung nodule detection
(John Wiley and Sons Inc, 2022) Dodia, S.; Annappa, B.; Mahesh, M.
Recent advancements in deep learning have achieved great success in building a reliable computer-aided diagnosis (CAD) system. In this work, a novel deep-learning architecture, named receptive field regularized V-net (RFR V-Net), is proposed for detecting lung cancer nodules with reduced false positives (FP). The method uses a receptive regularization on the encoder block's convolution and deconvolution layer of the decoder block in the V-Net model. Further, nodule classification is performed using a new combination of SqueezeNet and ResNet, named nodule classification network (NCNet). Postprocessing image enhancement is performed on the 2D slice by increasing the image's intensity by adding pseudo-color or fluorescence contrast. The proposed RFR V-Net resulted in dice similarity coefficient of 95.01% and intersection over union of 0.83, respectively. The proposed NCNet achieved the sensitivity of 98.38% and FPs/Scan of 2.3 for 3D representations. The proposed NCNet resulted in considerable improvements over existing CAD systems. © 2021 Wiley Periodicals LLC.
A comprehensive review of facial expression recognition techniques
(Springer Science and Business Media Deutschland GmbH, 2023) Rashmi Adyapady, R.R.; Annappa, B.
Emotion recognition has opened up many challenges, which lead to various advances in computer vision and artificial intelligence. The rapid development in this field has encouraged the development of an automatic system that could accurately analyze and measure the emotions of human beings via facial expressions. This study mainly focuses on facial expression recognition from visual cues, as visual information is the most prominent channel for social communication. The paper provides a comprehensive review of recent advancements in algorithm development, presents the overall findings performed over the past decades, discusses their advantages and constraints. It explores the transition from the laboratory-controlled environment to challenging real-world (in-the-wild) conditions, focusing on essential issues that require further exploration. Finally, relevant opportunities in this field, challenges, and future directions mentioned in this paper assist the researchers and academicians in designing efficient and robust facial expression recognition systems. © 2022, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.
An ensemble approach using a frequency-based and stacking classifiers for effective facial expression recognition
(Springer, 2023) Adyapady R, R.; Annappa, B.
Facial Expression Recognition is an essential aspect of human behavior to communicate effectively. A more profound understanding of human behavior, accurate analysis, and interpretation of the emotional content is essential. Hence, facial features play a crucial role as they contain beneficial information about facial expressions. A baseline architecture belonging to the EfficientNet family of models is explored for feature extraction. In this work, two novel strategies, the ensemble model using the frequency-based voting approach (FV-EffNet) and the stacking classifier (SC-EffNet), are proposed to enhance classification results’ performance. The proposed system deals with both profile and frontal pose variations. The combination of deep learning models with a stacking classifier gave the best results of 98.35% and 98.06%, and the frequency-based approach used with the ensemble classifier achieved superior performance of 98.71% and 98.56% on Oulu-CASIA and RaFD datasets, respectively. The experiment results with the proposed methodology showed better performance than previous studies on Oulu-CASIA and RaFD datasets, making it more robust to pose variations. © 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
KAC SegNet: A Novel Kernel-Based Active Contour Method for Lung Nodule Segmentation and Classification Using Dense AlexNet Framework
(World Scientific, 2024) Dodia, S.; Annappa, B.; Mahesh, P.A.
Lung cancer is known to be one of the leading causes of death worldwide. There is a chance of increasing the survival rate of the patients if detected at an early stage. Computed Tomography (CT) scans are prominently used to detect and classify lung cancer nodules/tumors in the thoracic region. There is a need to develop an efficient and reliable computer-aided diagnosis model to detect lung cancer nodules accurately from CT scans. This work proposes a novel kernel-based active-contour (KAC) SegNet deep learning model to perform lung cancer nodule detection from CT scans. The active contour uses a snake method to detect internal and external boundaries of the curves, which is used to extract the Region Of Interest (ROI) from the CT scan. From the extracted ROI, the nodules are further classified into benign and malignant using a Dense AlexNet deep learning model. The key contributions of this work are the fusion of an edge detection method with a deep learning segmentation method which provides enhanced lung nodule segmentation performance, and an ensemble of state-of-the-art deep learning classifiers, which encashes the advantages of both DenseNet and AlexNet to learn better discriminative information from the detected lung nodules. The experimental outcome shows that the proposed segmentation approach achieves a Dice Score Coefficient of 97.8% and an Intersection-over-Union of 92.96%. The classification performance resulted in an accuracy of 95.65%, a False Positive Rate, and False Negative Rate values of 0.0572 and 0.0289. The proposed model is robust compared to the existing state-of-the-art methods. © 2024 World Scientific Publishing Company.
Efficient Kalman filter based deep learning approaches for workload prediction in cloud and edge environments
(Springer, 2025) Kumar, M.R.; Annappa, B.; Yadav, V.
Offering cloud resources to consumers presents several difficulties for cloud service providers. When utilizing resources efficiently in cloud and edge contexts, precisely forecasting workload is a crucial problem. Accurate workload prediction allows intelligent resource allocation, preventing needless waste of computational and storage resources while meeting user’s Quality of Service(QoS). In order to mitigate this issue, Kalman filter-based novel hybrid models, including Long Short Term Memory (LSTM), Bi-directional Long Short Term Memory (BI-LSTM), and Gated Recurrent Unit (GRU), are proposed. These models utilize CNN and attention mechanisms to predict workloads at Edge Servers accurately. The proposed models were extensively evaluated on real world traces like Alibaba_v2018, Materna, Bitbrains, Microsoft Azure_2019 and Planet lab datasets at various time intervals with and without using Kalman filter. The experimental comparison shows that 97%, 82% and 90% reduction in MSE for Alibaba, 73%, 73% and 63% reduction in MSE for Materna, 72%, 63% and 40% reduction in MSE for Planet lab, 95%, 77% and 96% reduction in MSE for Microsoft Azure and 91%, 87% and 91% reduction in MSE for Bitbrains with respect to CPU utilization %. The effectiveness of the proposed forecasting model is validated through statistical analysis using the Friedman and Nemenyi post-hoc tests. © The Author(s), under exclusive licence to Springer-Verlag GmbH Austria, part of Springer Nature 2024.

Journal Articles

Browse

Filters

Settings

Sort By

Results per page

Search Results