An efficient colossal closed itemset mining algorithm for a dataset with high dimensionality

dc.contributor.authorVanahalli, M.K.
dc.contributor.authorPatil, N.
dc.date.accessioned2026-02-04T12:28:02Z
dc.date.issued2022
dc.description.abstractThe greater interest of research in the field of bioinformatics and the ample amount of available data across the different domains paved the way for the generation of the dataset with high dimensionality. The number of features in the dataset with high dimensionality are very high and number of rows are less. The significance of the Frequent Colossal Closed Itemsets (FCCI) is high for diverse applications and also for the field of bioinformatics. FCCI are very prominent in the process of the decision making. Amount of information extraction from the dataset with high dimensionality is huge and this extraction is a non-trivial task. The pruning of all the inadmissible features and rows is not performed by the state-of-the-art algorithms. The proposed work articulates the pruning of all the inadmissible features and rows, an efficient pruning strategy to snip the row enumeration mining search space and closure method for checking the closeness of the rowset. An efficient row enumeration algorithm enclosing the rowset closure checking method and pruning strategy is designed to efficiently mine the complete set of FCCI. The experimental results demonstrate the effectiveness of pruning all the inadmissible features and rows. © 2020 The Authors
dc.identifier.citationJournal of King Saud University - Computer and Information Sciences, 2022, 34, 6, pp. 2798-2808
dc.identifier.issn13191578
dc.identifier.urihttps://doi.org/10.1016/j.jksuci.2020.04.008
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/22573
dc.publisherKing Saud bin Abdulaziz University
dc.subjectBioinformatics
dc.subjectDecision making
dc.subjectHigh dimensionality
dc.subjectMining search space
dc.subjectPruning strategy
dc.subjectRowset closeness checking
dc.titleAn efficient colossal closed itemset mining algorithm for a dataset with high dimensionality

Files

Collections