Faculty Publications

Permanent URI for this communityhttps://idr.nitk.ac.in/handle/123456789/18736

Publications by NITK Faculty

Browse

Search Results

Now showing 1 - 2 of 2
  • Item
    Bayesian optimization and gradient boosting to detect phishing websites
    (Institute of Electrical and Electronics Engineers Inc., 2021) Pavan, R.; Nara, M.; Gopinath, S.; Patil, N.
    We propose an Extreme Gradient Boosting framework for classification and regression problems emerging in machine learning for small-sized data sources sampled from a discrete distribution, i.e. data containing discrete or quantized attributes. The model parameters are iteratively refined from a prior belief for specific use cases using Bayesian optimization. We focus the application area of this framework on detecting fraudulent websites. With properly stated reasoning, we empirically test our methodology on a publicly available and bench-marked UCI Phishing dataset to demonstrate the superior performance of this approach as compared to existing methods in the literature. © 2021 IEEE.
  • Item
    EGA-FMC: Enhanced genetic algorithm-based fuzzy k-modes clustering for categorical data
    (Inderscience Enterprises Ltd., 2018) Narasimhan, M.; Balasubramanian, B.; Kumar, S.D.; Patil, N.
    Categorical data clustering is the unsupervised technique of grouping similar objects which have categorical attributes. We propose a genetic algorithm-based fuzzy k-modes categorical data clustering algorithm using multi-objective rank-based selection with enhanced elitism operation. Compactness of the clusters and inter-cluster separation were chosen as objectives to be optimised. During elitism, in every iteration, the best parent chromosomes were identified. The entire population was passed through the selection, crossover and mutation steps. The worst children were then replaced by the best parents. Our method was evaluated on three real-world datasets and resulted in clusters of better quality as compared to current methods with a significant reduction in computation time. Additionally, statistical significance tests were conducted to show the superiority of our approach over other clustering solutions. © © 2018 Inderscience Enterprises Ltd.