Faculty Publications
Permanent URI for this communityhttps://idr.nitk.ac.in/handle/123456789/18736
Publications by NITK Faculty
Browse
9 results
Search Results
Item A novel technique of feature selection with relieff and CFS for protein sequence classification(Springer Verlag service@springer.de, 2019) Kaur, K.; Patil, N.Bioinformatics has gained wide importance in research area for the last few decades. The main aim is to store the biological data and analyze it for better understanding. To predict the functions of newly added protein sequences, the classification of existing protein sequence is of great use. The rate at which protein sequence data is getting accumulated is increasing exponentially. So, it emerges as a very challenging task for the researcher, to deal with large number of features obtained by the use of various encoding techniques. Here, a two-stage algorithm is proposed for feature selection that combines ReliefF and CFS technique that takes extracted features as input and provides us with the discriminative set of features. The n-gram sequence encoding technique has been used to extract the feature vector from the protein sequences. In the first stage, ReliefF approach is used to rank the features and obtain candidate feature set. In the second stage, CFS is applied on this candidate feature set to obtain features that have high correlation with the class but less correlation with other features. The classification methods like Naive-Bayes, decision tree, and k-nearest neighbor can be used to analyze the performance of proposed approach. It is observed that this approach has increased accuracy of classification methods in comparison to existing methods. © Springer Nature Singapore Pte Ltd. 2019Item Bayesian optimization and gradient boosting to detect phishing websites(Institute of Electrical and Electronics Engineers Inc., 2021) Pavan, R.; Nara, M.; Gopinath, S.; Patil, N.We propose an Extreme Gradient Boosting framework for classification and regression problems emerging in machine learning for small-sized data sources sampled from a discrete distribution, i.e. data containing discrete or quantized attributes. The model parameters are iteratively refined from a prior belief for specific use cases using Bayesian optimization. We focus the application area of this framework on detecting fraudulent websites. With properly stated reasoning, we empirically test our methodology on a publicly available and bench-marked UCI Phishing dataset to demonstrate the superior performance of this approach as compared to existing methods in the literature. © 2021 IEEE.Item Survey of Leukemia Cancer Cell Detection Using Image Processing(Springer Science and Business Media Deutschland GmbH, 2022) Devi, T.G.; Patil, N.; Rai, S.; Philipose, C.S.Cancer is the development of abnormal cells that divide at an abnormal pace, uncontrollably. Cancerous cells have the ability to destroy other normal tissues and can spread throughout the body. Cancer cells can develop in various parts of the body. The paper focuses on leukemia which is a type of blood cancer. Blood cancer usually start in the bone marrow where the blood is produced in the body. The types of blood cancer are: Leukemia, Non-Hodgkin lymphoma, Hodgkin lymphoma, and Multiple myeloma. Leukemia is a type of blood cancer that originates in the bone marrow. Leukemia is seen when the body produces an abnormal amount of white blood cells that hinder the bone marrow from creating red blood cells and platelets. Several detection methods to identify the cancerous cells have been proposed. Identification of the cancer cells through cell image processing is very complex. The use of computer aided image processing allows the images to be viewed in 2D and 3D making it easier to identify the cancerous cells. The cells have to undergo segmentation and classification in order to identify the cancerous tumours. Several papers propose segmentation methods, classification methods and some propose both. The purpose of this survey is to review various papers that use either conventional methods or machine learning methods to detect the cells as cancerous and non-cancerous. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.Item Sentiment Analysis on Worldwide COVID-19 Outbreak(Springer Science and Business Media Deutschland GmbH, 2024) Vasudev, R.; Dahikar, P.; Jain, A.; Patil, N.Sentiment analysis has proved to be an effective way to easily mine public opinions on issues, products, policies, etc. One of the ways this is achieved is by extracting social media content. Data extracted from the social media has proven time and again to be the most powerful source material for sentiment analysis tasks. Twitter, which is widely used by the general public to express their concerns over daily affairs, can be the strongest tool to provide data for such analysis. In this paper, we intend to use the tweets posted regarding the COVID-19 pandemic for a sentiment analysis study and sentiment classification using BERT model. Due to its transformer architecture and bidirectional approach, this deep learning model can be easily preferred as the best choice for our study. As expected, the model performed very well in all the considered classification metrics and achieved an overall accuracy of 92%. © 2024, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.Item Soil Type Identification via Deep Learning and Machine Learning Methods(Springer Science and Business Media Deutschland GmbH, 2024) Jalapur, S.; Patil, N.Soil type identification stands as a crucial concern in numerous countries, to ensure optimal crop yield, farmers need to accurately identify the suitable soil type for specific crops, which plays a significant role in meeting the heightened global food demand. The objective of this survey paper is to present a thorough and up-to-date overview of prevailing methodologies in soil identification, primarily focusing on image analysis, machine learning, and deep learning techniques. The paper initiates by highlighting the significance of soil identification and the limitations inherent in traditional methods. It then delves into the fundamental principles of image processing, deep learning, and spectroscopy, explaining how these techniques can be applied to soil identification. The survey presents an in-depth analysis of various image processing techniques employed for soil identification, including image segmentation, feature extraction, and classification algorithms. Furthermore, it discusses the application of deep learning models for soil classification based on image data. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.Item Image Analysis of Nuclei Histopathology Using Deep Learning: A Review of Segmentation, Detection, and Classification(Springer, 2023) Kadaskar, M.; Patil, N.Deep learning has recently advanced in its applicability to computer vision challenges, and medical imaging has become the most used technique in histopathology image analysis. Nuclei instance segmentation, detection, and classification are one such task. Reliable analysis of these image slides is critical in cancer identification, treatment, and care. Researchers have recently been interested in this issue. This study reviews the categorization and investigation of strategies utilized in recent works to improve the effectiveness of automated nuclei segmentation, detection, and classification in histopathology images. It critically examines state-of-the-art deep learning techniques, analyzes the trends, identifies the challenges, and highlights and helps with the future directions for research. The taxonomy includes deep learning techniques, enhancement, and optimization methods. The survey findings will help to overcome the challenges of nuclei segmentation, detection, and classification while improving the performance of models and, thus, aid future research plans. © 2023, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.Item A fast and novel approach based on grouping and weighted mRMR for feature selection and classification of protein sequence data(Inderscience Publishers, 2020) Kaur, K.; Patil, N.The analysis of protein sequences under bioinformatics has gained wide importance in research area. Newly added protein sequences can be analysed using existing proteins and converting them into feature vector form. However, it emerges as a challenging task to deal with huge number of features obtained using sequence encoding techniques. Since all the features obtained are not actually required, a three-stage feature selection approach has been proposed. In the first stage, features are ranked and most irrelevant features are removed; in the second stage, conflicting features are grouped together; and in third stage, a fast approach based on weighted Minimum Redundancy Maximum Relevance (wMRMR) has been proposed and applied on grouped features. Different classification methods are used to analyse the performance of the proposed approach. It is observed that the proposed approach has increased classification accuracy results and reduced time consumption in comparison to the state-of-the-art methods. © 2020 Inderscience Enterprises Ltd.Item Utilizing Deep Learning Models and Transfer Learning for COVID-19 Detection from X-Ray Images(Springer, 2023) Agrawal, S.; Venkatesh, V.; Nara, M.; Patil, N.COVID-19 has been a global pandemic. Flattening the curve requires intensive testing, and the world has been facing a shortage of testing equipment and medical personnel with expertise. There is a need to automate and aid the detection process. Several diagnostic tools are currently being used for COVID-19, including X-Rays and CT-scans. This study focuses on detecting COVID-19 from X-Rays. We pursue two types of problems: binary classification (COVID-19 and No COVID-19) and multi-class classification (COVID-19, No COVID-19 and Pneumonia). We examine and evaluate several classic models, namely VGG19, ResNet50, MobileNetV2, InceptionV3, Xception, DenseNet121, and specialized models such as DarkCOVIDNet and COVID-Net and prove that ResNet50 models perform best. We also propose a simple modification to the ResNet50 model, which gives a binary classification accuracy of 99.20% and a multi-class classification accuracy of 86.13%, hence cementing the ResNet50’s abilities for COVID-19 detection and ability to differentiate pneumonia and COVID-19. The proposed model’s explanations were interpreted via LIME which provides contours, and Grad-CAM, which provides heat-maps over the area(s) of interest of the classifier, i.e., COVID-19 concentrated regions in the lungs, and realize that LIME explains the results better. These explanations support our model’s ability to generalize. The proposed model is intended to be deployed for free use. © 2023, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.Item ANet: Nuclei Instance Segmentation and Classification with Attention-Based Network(Springer, 2024) Kadaskar, M.; Patil, N.The segmentation and classification of nuclei in haematoxylin and eosin-stained images is critical for diagnosing cancer and other disorders. Developing automated methods is necessary for the quantitative analysis of whole-slide images and further downstream analysis. However, many challenges are to be solved, such as varying morphology and observer differences. To address these concerns, we present ANet, an encoder–decoder structure based on attention mechanisms for nuclear segmentation and classification that makes use of information in high-dimensional features improved by attention. These blocks generate meaningful feature activation and eliminate irrelevant information to produce finer maps. It segments the touching, clustered, and overlapping nuclei and classifies them using upsampling branches. Our method includes components such as PreAct-ResNet50, residual attention, convolutional block attention module, and dense attention unit. We demonstrate how our approach achieves cutting-edge performance on several multi-tissue histopathology datasets such as Kumar, CoNSeP, and CPM17. We also demonstrate our model’s generalization capabilities on other combinations of datasets, including CPM15 and TNBC. ANet demonstrates a notable improvement of 1.14%, 2.70%, 1.41%, and 1.29% in Dice, AJI, SQ, and PQ scores, respectively, for the CPM17 dataset. In addition, it achieves a 1.18% improvement in AJI score for the Kumar dataset. Despite the inherent challenges in nuclei classification within the CoNSeP dataset, ANet yields outstanding results, showcasing a substantial improvement of 9.74%, 3.97%, and 0.80% in F1 scores for the inflammatory, spindle, and miscellaneous classes. Furthermore, ANet exhibits strong generalization across the CPM dataset, TNBC, and Combined CoNSeP, with improvements observed in the majority of metrics. The given improvement is justifiable, as are the interpretable visual results. The proposed method is of great potential for analyzing histopathology images, demonstrated by an increment of performance in multiple metrics. © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd 2024.
