Conference Papers
Permanent URI for this collectionhttps://idr.nitk.ac.in/handle/123456789/28506
Browse
7 results
Search Results
Item Feature selection using fast ensemble learning for network intrusion detection(Springer Verlag service@springer.de, 2020) Pasupulety, U.; Adwaith, C.D.; Hegde, S.; Patil, N.Network security plays a critical role in today’s digital system infrastructure. Everyday, there are hundreds of cases of data theft or loss due to the system’s integrity being compromised. The root cause of this issue is the lack of systems in place which are able to foresee the advent of such attacks. Network Intrusion detection techniques are important to prevent any system or network from malicious behavior. By analyzing a dataset with features summarizing the method in which connections are made to the network, any attempt to access it can be classified as malicious or benign. To improve the accuracy of network intrusion detection, various machine learning algorithms and optimization techniques are used. Feature selection helps in finding important attributes in the dataset which have a significant effect on the final classification. This results in the reduction of the size of the dataset, thereby simplifying the task of classification. In this work, we propose using multiple techniques as an ensemble for feature selection. To reduce training time and retain accuracy, the important features of a subset of the KDD Network Intrusion detection dataset were analyzed using this ensemble learning technique. Out of 41 possible features for network intrusion, it was found that host-based statistical features of network flow play an import role in predicting network intrusion. Our proposed methodology provides multiple levels of overall selected features, correlated to the number of individual feature selection techniques that selected them. At the highest level of selected features, our experiments yielded a 6% increase in intrusion detection accuracy, an 81% decrease in dataset size and a 5.4× decrease in runtime using a Multinomial Naive Bayes classifier on the original dataset. © Springer Nature Switzerland AG 2020.Item Designing Scalable Intrusion Detection Systems with Stacking Based Ensemble Learning(Springer Science and Business Media Deutschland GmbH, 2022) Sujan Reddy, A.S.; Akashdeep, S.; Kamath S․, S.; Rudra, B.Network Intrusion Detection Systems monitor the network traffic and reports any malicious activity. In this paper, a combination of feature engineering techniques and Ensemble Learning is proposed to build an effective Intrusion Detection System. The zero importance feature selection method is used to extract 23 features. Random forests, Feed Forward Neural Networks and Auto encoders are used as the base models and the predictions from these base models are combined using Extreme Gradient Boosting (XGB). To ensure that the proposed ensemble model is scalable as well, parallel programming is used for parallel computation of class probabilities from each model of the ensemble. The NSL-KDD dataset is used to train our models. To test our models, we use KDD+test dataset. Experimental results show that the proposed ensemble model outperforms several state-of-the-art works. The proposed parallel programming approach decreases the average prediction time of the model ensuring that the model is scalable. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.Item Fake News Detection Using Genetic Algorithm-Based Feature Selection and Ensemble Learning(Springer Science and Business Media Deutschland GmbH, 2022) Nikitha, K.M.; Rozario, R.; Pradeep, C.; Ananthanarayana, V.S.Since its conception roughly 40 years ago, the Internet has always been an unpoliced area of human interaction. This lawlessness has since been curbed with legislation, making nefarious activities on the web constitutionally punishable. However, in the case of fake news and disinformation campaigns, the responsibility of verification is placed on the reader and the publisher, and there is no easily executable legal recourse for wrongdoers. This lack of policing combined with the power of controlling popular opinion for uses such as election manipulation, slander as a form of blackmail, stock manipulation for insider trading, shielding corporate wrong-doing makes it clear that this is a problem worth solving. Furthermore, we believe that automating the process is crucial as the task requires processing a massive amount of information whilst also being free of all biases, which is not possible by a human team. This paper explores different text properties that can indicate if a newspaper article is likely to be false or real. Our novel approach makes use of an ensemble learner created using weak learners. The weak learners are further trained on selective features to make them moderate learners. Our study shows that training individual models on different sets of features extracted using genetic algorithms performs better than models trained on all features. These become moderate learners and surpass the weak learners on performance. Further, when we ensemble these moderate learners, we achieve superior results than normal ensemble learners. © 2022, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.Item Machine Learning based COVID-19 Mortality Prediction using Common Patient Data(Institute of Electrical and Electronics Engineers Inc., 2022) Agrawal, S.; Patil, N.COVID-19 was declared a pandemic in 2020, and it caused havoc worldwide. The fact that it is unpredictable adds to its lethality. The world has already seen various COVID-19 infection waves, subsequent waves being even more deadly. Many patients are asymptomatic initially but suddenly develop breathing problems. More than four million people have died due to COVID-19. It is necessary to forecast a patient's likelihood of dying so that appropriate precautions can be implemented. In this study, a COVID-19 mortality prediction model which uses machine learning is proposed. Most of the current research work requires several patient features and lab test results to predict mortality. However, we suggest a simpler and more efficient technique that relies solely on X-rays and basic patient information such as age and gender. Several ensemble-based models were evaluated and compared using a variety of metrics, and the best method was able to achieve a classification accuracy of 92.6% and AUPRC of 0.95. © 2022 IEEE.Item Ensemble Learning Approach for Short-term Energy Consumption Prediction(Association for Computing Machinery, 2022) Sujan Reddy, A.; Akashdeep; Harshvardhan; Kamath S․, S.Predicting electricity consumption accurately is crucial for garnering insights and potential trends into energy consumption for effective resource management. Due to the linearity/non-linearity in usage patterns, electricity consumption prediction is challenging and cannot be adequately solved by using a single model. In this paper, we propose ensemble learning based approaches for short-term electricity consumption on an open dataset. The ensemble model is built on the combined predictions of supervised machine learning and deep learning base models. Experimental validation showed that the proposed ensemble model is more accurate and decreases the training time of the second layer of the ensemble by a factor close to ten, compared to the state-of-the-art. We observed a reduction of approximately 34% in the Root mean squared error for the same size of historical window. © 2022 Owner/Author.Item Automatic Abnormality Detection in Musculoskeletal Radiographs Using Ensemble of Pre-trained Networks(Springer Science and Business Media Deutschland GmbH, 2023) Verma, R.; Jain, S.; Saritha, S.K.; Dodia, S.Musculoskeletal disability (MSDs) defined as the injuries that affect the movement or musculoskeletal system of the human body. Over the worldwide, it is the second most cause of physical disability. Musculoskeletal disability worsens over time and can result in long-term discomfort and severe disability. As a result, early detection and diagnosis of these anomalies is essential. But the diagnosis process is very time consuming, error prone and required diagnostic professional. Deep learning algorithms have recently been applied in medical imaging that provides a robust platform with very reliable outcomes. The development of Computer Aided Detection (CAD) system extensively speed up the diagnosis process. In this paper, a weighted ensemble model has been proposed, which is the combination of three pre-trained models (DenseNet169, MobileNet, and XceptionNet). The weighted ensemble model is tested on MURA dataset, a large public dataset provided by Stanford ML Group. Our model achieved a cohen’s kappa score 0.739 with precision of 0.885 and recall of 0.854, which is higher than many existing approaches such as densenet169 and ensemble200 model. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.Item Cardiovascular Diseases Divination using Artificial Neural Network with Ensemble Models(Institute of Electrical and Electronics Engineers Inc., 2023) Pabitha, B.; Sanshi, S.; Karthik, N.Health is wealth, but nowadays, wealth is health, where humans keep running their day-to-day activities without caring about their health for various reasons. Every human being in this world suffers from one or other diseases. Recently, cardiovascular diseases like heart attacks are prevalent in all age groups. Addressing cardiovascular diseases is essential before the disease reaches a crucial stage. Nowadays, artificial intelligence algorithms have been used to detect diseases in their early stages. In this piece of writing, a model of an artificial neural network is utilised to analyze, detect and predict the likelihood of cardiovascular disease in the early stages. In this proposed work, feed forward propagation, forward the input data to learn and map the relationships between inputs and outputs, and backward propagation is used to reduce the errors in the data. Further, an ensemble learning stacked model is used to achieve high accuracy in the prediction of diseases. To verify the correctness of the model, ensemble learning to stack is executed with three different models, namely Model 1, Model 2, and Model 3, with varying sets of feature selections. The experiment results show an accuracy rate of 93% in their predictions. © 2023 IEEE.
