Faculty Publications
Permanent URI for this communityhttps://idr.nitk.ac.in/handle/123456789/18736
Publications by NITK Faculty
Browse
2 results
Search Results
Item EGA-FMC: Enhanced genetic algorithm-based fuzzy k-modes clustering for categorical data(Inderscience Enterprises Ltd., 2018) Narasimhan, M.; Balasubramanian, B.; Kumar, S.D.; Patil, N.Categorical data clustering is the unsupervised technique of grouping similar objects which have categorical attributes. We propose a genetic algorithm-based fuzzy k-modes categorical data clustering algorithm using multi-objective rank-based selection with enhanced elitism operation. Compactness of the clusters and inter-cluster separation were chosen as objectives to be optimised. During elitism, in every iteration, the best parent chromosomes were identified. The entire population was passed through the selection, crossover and mutation steps. The worst children were then replaced by the best parents. Our method was evaluated on three real-world datasets and resulted in clusters of better quality as compared to current methods with a significant reduction in computation time. Additionally, statistical significance tests were conducted to show the superiority of our approach over other clustering solutions. © © 2018 Inderscience Enterprises Ltd.Item A novel filter–wrapper hybrid greedy ensemble approach optimized using the genetic algorithm to reduce the dimensionality of high-dimensional biomedical datasets(Elsevier Ltd, 2019) Gangavarapu, T.; Patil, N.The predictive accuracy of high-dimensional biomedical datasets is often dwindled by many irrelevant and redundant molecular disease diagnosis features. Dimensionality reduction aims at finding a feature subspace that preserves the predictive accuracy while eliminating noise and curtailing the high computational cost of training. The applicability of a particular feature selection technique is heavily reliant on the ability of that technique to match the problem structure and to capture the inherent patterns in the data. In this paper, we propose a novel filter–wrapper hybrid ensemble feature selection approach based on the weighted occurrence frequency and the penalty scheme, to obtain the most discriminative and instructive feature subspace. The proposed approach engenders an optimal feature subspace by greedily combining the feature subspaces obtained from various predetermined base feature selection techniques. Furthermore, the base feature subspaces are penalized based on specific performance dependent penalty parameters. We leverage effective heuristic search strategies including the greedy parameter-wise optimization and the Genetic Algorithm (GA) to optimize the subspace ensembling process. The effectiveness, robustness, and flexibility of the proposed hybrid greedy ensemble approach in comparison with the base feature selection techniques, and prolific filter and state-of-the-art wrapper methods are justified by empirical analysis on three distinct high-dimensional biomedical datasets. Experimental validation revealed that the proposed greedy approach, when optimized using GA, outperformed the selected base feature selection techniques by 4.17%–15.14% in terms of the prediction accuracy. © 2019 Elsevier B.V.
