Frequent pattern mining on stream data using Hadoop CanTree-GTree

dc.contributor.author	Kusumakumari, V.
dc.contributor.author	Sherigar, D.
dc.contributor.author	Chandran, R.
dc.contributor.author	Patil, N.
dc.date.accessioned	2026-02-06T06:38:54Z
dc.date.issued	2017
dc.description.abstract	The need for knowledge discovery from real-time stream data is continuously increasing nowadays and processing of transactions for mining patterns needs efficient data structures and algorithms. We propose a time-efficient Hadoop CanTree-GTree algorithm, using Apache Hadoop. This algorithm mines the complete frequent item sets (patterns) from real time transactions, by utilizing the sliding window technique. These are used to mine for closed frequent item sets and then, association rules are derived. It makes use of two data structures - CanTree and GTree. The results show that the Hadoop implementation of the algorithm performs 5 times better than in Java. Â© 2017 The Author(s).
dc.identifier.citation	Procedia Computer Science, 2017, Vol.115, , p. 266-273
dc.identifier.issn	18770509
dc.identifier.uri	https://doi.org/10.1016/j.procs.2017.09.134
dc.identifier.uri	https://idr.nitk.ac.in/handle/123456789/31967
dc.publisher	Elsevier B.V.
dc.subject	CanTree
dc.subject	Frequent item sets
dc.subject	GTree
dc.subject	Hadoop
dc.subject	Stream data mining
dc.title	Frequent pattern mining on stream data using Hadoop CanTree-GTree

Collections