Distributed load balancing frequent colossal closed itemset mining algorithm for high dimensional dataset

Vanahalli M.K.; Patil N.

Please use this identifier to cite or link to this item: https://idr.nitk.ac.in/jspui/handle/123456789/16134

Full metadata record

DC Field	Value	Language
dc.contributor.author	Vanahalli M.K.
dc.contributor.author	Patil N.
dc.date.accessioned	2021-05-05T10:29:50Z	-
dc.date.available	2021-05-05T10:29:50Z	-
dc.date.issued	2020
dc.identifier.citation	Journal of Parallel and Distributed Computing , Vol. 144 , , p. 136 - 152	en_US
dc.identifier.uri	https://doi.org/10.1016/j.jpdc.2020.05.017
dc.identifier.uri	http://idr.nitk.ac.in/jspui/handle/123456789/16134	-
dc.description.abstract	The focus of extracting colossal closed itemsets from high dimensional biological datasets has been great in recent times. A massive set of short and average sized mined itemsets do not confine complete and valuable information for decision making. But, the traditional itemset mining algorithms expend a gigantic measure of time in mining a massive set of short and average sized itemsets. The greater interest of research in the field of bioinformatics and the abundant data across the variety of domains paved the way for the generation of the high dimensional dataset. These datasets are depicted by an extensive number of features and a smaller number of rows. Colossal closed itemsets are very significant for numerous applications including the field of bioinformatics and are influential during the decision making. Extracting a huge amount of information and knowledge from the high dimensional dataset is a nontrivial task. The existing colossal closed itemsets mining algorithms for the high dimensional dataset are sequential and computationally expensive. Distributed and parallel computing is a good strategy to overcome the inefficiency of the existing sequential algorithm. Balanced Distributed Parallel Frequent Colossal Closed Itemset Mining (BDPFCCIM) algorithm is designed for high dimensional datasets. An efficient closeness checking method to check the closeness of the rowset and an efficient pruning strategy to snip the row enumeration mining search space is enclosed with the proposed BDPFCCIM algorithm. The proposed BDPFCCIM algorithm is the first distributed load balancing algorithm to mine frequent colossal closed itemsets from high dimensional biological datasets. The experimental results demonstrate the efficient performance of the proposed BDPFCCIM algorithm in comparison with the state-of-the-art algorithms. © 2020 Elsevier Inc.	en_US
dc.title	Distributed load balancing frequent colossal closed itemset mining algorithm for high dimensional dataset	en_US
dc.type	Article	en_US
Appears in Collections:	1. Journal Articles

Files in This Item:

There are no files associated with this item.

Show simple item record