Mining closed colossal frequent patterns from high-dimensional dataset: Serial versus parallel framework

dc.contributor.authorSureshan, S.
dc.contributor.authorPenumacha, A.
dc.contributor.authorJain, S.
dc.contributor.authorVanahalli, M.
dc.contributor.authorPatil, N.
dc.date.accessioned2026-02-06T06:38:29Z
dc.date.issued2018
dc.description.abstractMining colossal patterns is one of the budding fields with a lot of applications, especially in the field of bioinformatics and genetics. Gene sequences contain inherent information. Mining colossal patterns in such sequences can further help in their study and improve prediction accuracy. The increase in average transaction length reduces the efficiency and effectiveness of existing closed frequent pattern mining algorithm. The traditional algorithms expend most of the running time in mining huge amount of minute and midsize patterns which do not enclose valuable information. The recent research focused on mining large cardinality patterns called as colossal patterns which possess valuable information. A novel parallel algorithm has been proposed to extract the closed colossal frequent patterns from high-dimensional datasets. The algorithm has been implemented on Hadoop framework to exploit its inherent distributed parallelism using MapReduce programming model. The experiment results highlight that the proposed parallel algorithm on Hadoop framework gives an efficient performance in terms of execution time compared to the existing algorithms. © Springer Nature Singapore Pte Ltd. 2018.
dc.identifier.citationAdvances in Intelligent Systems and Computing, 2018, Vol.518, , p. 317-326
dc.identifier.issn21945357
dc.identifier.urihttps://doi.org/10.1007/978-981-10-3373-5_32
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/31707
dc.publisherSpringer Verlag service@springer.de
dc.subjectClosed colossal frequent patterns
dc.subjectClosed patterns
dc.subjectFrequent patterns
dc.subjectHadoop
dc.subjectHigh-dimensional datasets
dc.subjectMapReduce
dc.subjectMinimum support
dc.titleMining closed colossal frequent patterns from high-dimensional dataset: Serial versus parallel framework

Files