Journal Articles

Permanent URI for this collectionhttps://idr.nitk.ac.in/handle/123456789/19884

Browse

Search Results

Now showing 1 - 10 of 54
  • Item
    Recent developments in wireless capsule endoscopy imaging: Compression and summarization techniques
    (Elsevier Ltd, 2022) Sushma, B.; Aparna., P.
    Wireless capsule endoscopy (WCE) can be viewed as an innovative technology introduced in the medical domain to directly visualize the digestive system using a battery-powered electronic capsule. It is considered a desirable substitute for conventional digestive tract diagnostic methods for a comfortable and painless inspection. Despite many benefits, WCE results in poor video quality due to low frame resolution and diagnostic accuracy. Many research groups have presented diversified, low-complexity compression techniques to economize battery power consumed in the radio-frequency transmission of the captured video, which allows for capturing the images at high resolution. Many vision-based computational methods have been developed to improve the diagnostic yield. These methods include approaches for automatically detecting abnormalities and reducing the amount of time needed for video analysis. Though various research works have been put forth in the WCE imaging field, there is still a wide gap between the existing techniques and the current needs. Hence, this article systematically reviews recent WCE video compression and summarization techniques. The review's objectives are as follows: First, to provide the details of the requirement, challenges and design percepts for the low complexity WCE video compressor. Second, to discuss the most recent compression methods, emphasizing simple distributed video coding methods. Next, to review the most recent summarization techniques and the significance of using deep neural networks. Further, this review aims to provide a quantitative analysis of the state-of-the-art methods along with their advantages and drawbacks. At last, to discuss existing problems and possible future directions for building a robust WCE imaging framework. © 2022 Elsevier Ltd
  • Item
    An enhanced protein secondary structure prediction using deep learning framework on hybrid profile based features
    (Elsevier Ltd, 2020) Kumar, P.; Bankapur, S.; Patil, N.
    Accurate protein secondary structure prediction (PSSP) is essential to identify structural classes, protein folds, and its tertiary structure. To identify the secondary structure, experimental methods exhibit higher precision with the trade-off of high cost and time. In this study, we propose an effective prediction model which consists of hybrid features of 42-dimensions with the combination of convolutional neural network (CNN) and bidirectional recurrent neural network (BRNN). The proposed model is accessed on four benchmark datasets such as CB6133, CB513, CASP10, and CAP11 using Q3, Q8, and segment overlap (Sov) metrics. The proposed model reported Q3 accuracy of 85.4%, 85.4%, 83.7%, 81.5%, and Q8 accuracy 75.8%, 73.5%, 72.2%, and 70% on CB6133, CB513, CASP10, and CAP11 datasets respectively. The results of the proposed model are improved by a minimum factor of 2.5% and 2.1% in Q3 and Q8 accuracy respectively, as compared to the popular existing models on CB513 dataset. Further, the quality of the Q3 results is validated by structural class prediction and compared with PSI-PRED. The experiment showed that the quality of the Q3 results of the proposed model is higher than that of PSI-PRED. © 2019 Elsevier B.V.
  • Item
    Automatic detection of students’ affective states in classroom environment using hybrid convolutional neural networks
    (Springer, 2020) Ashwin, A.; Guddeti, R.M.R.
    Predicting the students’ emotional and behavioral engagements using computer vision techniques is a challenging task. Though there are several state-of-the-art techniques for analyzing a student’s affective states in an e-learning environment (single person’s engagement detection in a single image frame), a very few works are available for analyzing the students’ affective states in a classroom environment (multiple people in a single image frame). Hence, in this paper, we propose a novel hybrid convolutional neural network (CNN) architecture for analyzing the students’ affective states in a classroom environment. This proposed architecture consists of two models, the first model (CNN-1) is designed to analyze the affective states of a single student in a single image frame and the second model (CNN-2) uses multiple students in a single image frame. Thus, our proposed hybrid architecture predicts the overall affective state of the entire class. The proposed architecture uses the students’ facial expressions, hand gestures and body postures for analyzing their affective states. Further, due to unavailability of standard datasets for the students’ affective state analysis, we created, annotated and tested on our dataset of over 8000 single face in a single image frame and 12000 multiple faces in a single image frame with three different affective states, namely: engaged, boredom and neutral. The experimental results demonstrate an accuracy of 86% and 70% for posed and spontaneous affective states of classroom data, respectively. © 2019, Springer Science+Business Media, LLC, part of Springer Nature.
  • Item
    Segmentation of focal cortical dysplasia lesions from magnetic resonance images using 3D convolutional neural networks
    (Elsevier Ltd, 2021) Niyas, S.; Chethana Vaisali, S.; Show, I.; Chandrika, T.G.; Vinayagamani, S.; Kesavadas, C.; Rajan, J.
    Computer-aided diagnosis using advanced Artific ial Intelligence (AI) techniques has become much popular over the last few years. This work automates the segmentation of Focal Cortical Dysplasia (FCD) lesions from three-dimensional (3D) Magnetic Resonance (MR) images. FCD is a type of neuronal malformation in the brain cortex and is the leading cause of intractable epilepsy, irrespective of gender or age differences. Since the neuron related abnormalities are usually resistant to drug therapy, surgical resection has been the main treatment approach for patients with intractable epilepsy. Automating the identification and segmentation of FCD is useful for neuroradiologists in pre-surgical evaluations. Convolutional Neural Networks (CNNs) have the ability to learn appropriate features from the training data without any human intervention. But, most of the state-of-the-art FCD segmentation approaches use two-dimensional (2D) CNN models despite the availability of 3D Magnetic resonance imaging (MRI) volumes, and hence fail to leverage the inter-slice information present in the MRI volumes. The major hurdles in considering a 3D CNN model are the need for a large 3D dataset, big memory, and high computation cost. A deep 3D CNN segmentation model, which can extract inter-slice information and overcomes the drawbacks of conventional 3D CNN methods to an extent, is proposed in this paper. The model uses a 3D version of U-Net with residual blocks that works on shallow depth 3D sub-volumes generated from MRI volumes. The proposed method shows superior performance over the state-of-the-art FCD segmentation methods in both qualitative and quantitative analysis. © 2021 Elsevier Ltd
  • Item
    V3O2: hybrid deep learning model for hyperspectral image classification using vanilla-3D and octave-2D convolution
    (Springer Science and Business Media Deutschland GmbH, 2021) Mohan, A.; Sundaram, V.
    Remote sensing image analysis is an emerging area of research and is used for various applications such as climate analysis, crop monitoring and change detection. Hyperspectral image (HSI) is one of the dominant remote sensing imaging modalities that captures information beyond the visible spectrum. The evolution of deep learning has made a significant impact on HSI analysis, mainly for its classification. The spatial–spectral feature-based classification model improves the classification accuracy of hyperspectral images (HSIs). However, these models are computationally expensive, and redundancy exists in the spatial dimension of features. This research work proposes a hybrid convolutional neural network (CNN) for HSI classification. The proposed model uses principal component analysis (PCA) as a preprocessing technique for optimal band extraction from HSIs. The hybrid CNN classification technique extracts the spectral and spatial features using three-dimensional CNN (3D CNN). These features are fed into a two-dimensional CNN (2D CNN) for further feature extraction and classification. The redundancy in spatial features of the hybrid CNN model is reduced by octave convolution (OctConv) instead of standard vanilla convolution. OctConv factorizes the spatial features into lower and higher spatial frequencies, and different convolutions are performed on them based on their frequencies. The hybrid model is compared against various state-of-the-art CNN-based techniques and found that the accuracy is boosted with a lesser computational cost. © 2020, Springer-Verlag GmbH Germany, part of Springer Nature.
  • Item
    An effective feature extraction with deep neural network architecture for protein-secondary-structure prediction
    (Springer, 2021) Jayasimha, A.; Mudambi, R.; Pavan, P.; Lokaksha, B.M.; Bankapur, S.; Patil, N.
    With the increased importance of proteins in day-to-day life, it is imperative to know the protein functions. Deciphering protein structure elucidates protein functions. Experimental approaches for protein-structure analysis are expensive and time-consuming, and require high dexterity. Thus, finding a viable computational approach is vital. Due to the high complexity of predicting protein structure (tertiary structure) directly, research in this field aims at the protein-secondary-structure prediction which is directly related to its tertiary structure. This research aims at exploring a plethora of features, namely position-specific scoring matrices, hidden Markov model alignment matrices, and physicochemical properties, that carry rich information required to predict the secondary structure. Furthermore, it aims at exploring a suitable combination of the features which could capture diverse information about the protein secondary structure. Finally, a cascaded convolutional neural network and bidirectional long short-term memory architecture is fit on the models, and two evaluation metrics, namely, Q8 score and segment overlap score, are benchmarked on various datasets. Our proposed model trained on data of CB6133 dataset and tested on CB513 dataset beats the benchmark models by a minimum of 2.9%. © 2021, The Author(s), under exclusive licence to Springer-Verlag GmbH Austria, part of Springer Nature.
  • Item
    Cardamom Plant Disease Detection Approach Using EfficientNetV2
    (Institute of Electrical and Electronics Engineers Inc., 2022) Sunil, C.K.; Jaidhar, C.D.; Patil, N.
    Cardamom is a queen of spices. It is indigenously grown in the evergreen forests of Karnataka, Kerala, Tamil Nadu, and the northeastern states of India. India is the third largest producer of cardamom. Plant diseases cause a catastrophic influence on food production safety; they reduce the eminence and quantum of agricultural products. Plant diseases may cause significantly high loss or no harvest in dreadful cases. Various diseases and pests affect the growth of cardamom plants at different stages and crop yields. This study concentrated on two diseases of cardamom plants, Colletotrichum Blight and Phyllosticta Leaf Spot of cardamom and three diseases of grape, Black Rot, ESCA, and Isariopsis Leaf Spot. Various methods have been proposed for plant disease detection, and deep learning has become the preferred method because of its spectacular accomplishment. In this study, U2-Net was used to remove the unwanted background of an input image by selecting multiscale features. This work proposes a cardamom plant disease detection approach using the EfficientNetV2 model. A comprehensive set of experiments was carried out to ascertain the performance of the proposed approach and compare it with other models such as EfficientNet and Convolutional Neural Network (CNN). The experimental results showed that the proposed approach achieved a detection accuracy of 98.26%. © 2013 IEEE.
  • Item
    An empirical study of the impact of masks on face recognition
    (Elsevier Ltd, 2022) Jeevan, G.; Zacharias, G.C.; Nair, M.S.; Rajan, J.
    Face recognition has a wide range of applications like video surveillance, security, access control, etc. Over the past decade, the field of face recognition has matured and grown at par with the latest advancements in technology, particularly deep learning. Convolution Neural Networks have surpassed human accuracy in Face Recognition on popular evaluation tests such as LFW. However, most existing models evaluate their performance with an assumption of the availability of full facial information. The COVID-19 pandemic has laid forth challenges to this assumption, and to the performance of existing methods and leading-edge algorithms in the field of face recognition. This is in the wake of an explosive increase in the number of people wearing face masks. The reduced amount of facial information available to a recognition system from a masked face impacts their discrimination ability. In this context, we design and conduct a series of experiments comparing the masked face recognition performances of CNN architectures available in literature and exploring possible alterations in loss functions, architectures, and training methods that can enable existing methods to fully extract and leverage the limited facial information available in a masked face. We evaluate existing CNN-based face recognition systems for their performance against datasets composed entirely of masked faces, in contrast to the existing standard evaluations where masked or occluded faces are a rare occurrence. The study also presents evidence denoting an increased impact of network depth on performance compared to standard face recognition. Our observations indicate that substantial performance gains can be achieved by the introduction of masked faces in the training set. The study also inferred that various parameter settings determined suitable for standard face recognition are not ideal for masked face recognition. Through empirical analysis we derived new value recommendations for these parameters and settings. © 2021 Elsevier Ltd
  • Item
    Detection of arrhythmia from electrocardiogram signals using a novel gaussian assisted signal smoothing and pattern recognition
    (Elsevier Ltd, 2022) Chandrasekar, A.; Shekar, D.D.; Hiremath, A.C.; Chemmangat, K.
    The electrocardiogram is a widely used measurement for individual heart conditions, and much effort has been put into automatic arrhythmia diagnosis using machine learning. However, the classification performance is hampered by the use of less representative data in conjunction with traditional machine learning models. This paper proposes a novel algorithm for pre-processing raw Electrocardiogram signals via Gaussian Assisted Signal Smoothing. In this method, the ECG signal is modeled as a low pass component and a weighted sum of Gaussians. The Gaussians are used to model the peak characteristics of the signal, effectively preserving its structure and morphology while eliminating the noise, which is evident by the enhanced peak signal-to-noise ratio of the GASS signal. The R peaks obtained from the Pan Tompkins algorithm are used to extract the heartbeats from the filtered signal using a windowing technique. A cascaded combination of a Convolutional Neural Network and a Quadratic Support Vector Machine is then used to classify the heartbeats. The CNN model has 131,661 parameters, making it much lighter than previously reported works. The MIT-BIH Arrhythmia Database was used for our experiments. Across eleven classes, our results reveal that the model has an accuracy of 97.63% and an average F1 score of 0.9263. In contrast, previous works have primarily focused on a one vs. all or a five-class classification. From a signal processing standpoint, the proposed method offers a promising solution for Signal Filtering and Arrhythmia Classification. © 2021 Elsevier Ltd
  • Item
    Crossover based technique for data augmentation
    (Elsevier Ireland Ltd, 2022) Raj, R.; Mathew, J.; Kannath, S.K.; Rajan, J.
    Background and Objective: Medical image classification problems are frequently constrained by the availability of datasets. “Data augmentation” has come as a data enhancement and data enrichment solution to the challenge of limited data. Traditionally data augmentation techniques are based on linear and label preserving transformations; however, recent works have demonstrated that even non-linear, non-label preserving techniques can be unexpectedly effective. This paper proposes a non-linear data augmentation technique for the medical domain and explores its results. Methods: This paper introduces “Crossover technique”, a new data augmentation technique for Convolutional Neural Networks in Medical Image Classification problems. Our technique synthesizes a pair of samples by applying two-point crossover on the already available training dataset. By this technique, we create N new samples from N training samples. The proposed crossover based data augmentation technique, although non-label preserving, has performed significantly better in terms of increased accuracy and reduced loss for all the tested datasets over varied architectures. Results: The proposed method was tested on three publicly available medical datasets with various network architectures. For the mini-MIAS database of mammograms, our method improved the accuracy by 1.47%, achieving 80.15% using VGG-16 architecture. Our method works fine for both gray-scale as well as RGB images, as on the PH2 database for Skin Cancer, it improved the accuracy by 3.57%, achieving 85.71% using VGG-19 architecture. In addition, our technique improved accuracy on the brain tumor dataset by 0.40%, achieving 97.97% using VGG-16 architecture. Conclusion: The proposed novel crossover technique for training the Convolutional Neural Network (CNN) is painless to implement by applying two-point crossover on two images to form new images. The method would go a long way in tackling the challenges of limited datasets and problems of class imbalances in medical image analysis. Our code is available at https://github.com/rishiraj-cs/Crossover-augmentation © 2022