EfficientTreeMiner: Mining frequent induced substructures from XML documents without candidate generation

Santhi Thilagam, P.; Ananthanarayana, V.S.

EfficientTreeMiner: Mining frequent induced substructures from XML documents without candidate generation

Files

7929.pdf (2.63 MB)

Date

2006

Authors

Santhi Thilagam, P.

Ananthanarayana, V.S.

Abstract

Tree structures are used extensively in domains such as XML databases, computational biology, pattern recognition, computer networks, web mining, multi-relational data mining and so on. In this paper, we present an EfficientTreeMiner, a computationally efficient algorithm that discovers all frequently occurring induced subtrees in a database of labeled rooted unordered trees. The proposed algorithm mines frequent subtrees without generating any candidate subtrees. Efficiency is achieved by compressing the large database into a condensed data structure, namely prefix string representation, which reduces space complexity and by adopting a Frequent Immediate Descendents method that avoids the costly generation of candidate sets. Experimental results show that our algorithm has less time complexity when compared to existing approaches and is also scalable for mining both long and short frequent subtrees. � 2006 IEEE.

Citation

Proceedings - 2006 14th International Conference on Advanced Computing and Communications, ADCOM 2006, 2006, Vol., , pp.541-546

URI

https://idr.nitk.ac.in/jspui/handle/123456789/7929

Collections

2. Conference Papers

Full item page

EfficientTreeMiner: Mining frequent induced substructures from XML documents without candidate generation

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By