Prefix-Suffix trees: A novel scheme for compact representation of large datasets

Please use this identifier to cite or link to this item: https://idr.nitk.ac.in/jspui/handle/123456789/8838

Title:	Prefix-Suffix trees: A novel scheme for compact representation of large datasets
Authors:	Pai, R.M. Ananthanarayana, V.S.
Issue Date:	2007
Citation:	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2007, Vol.4815 LNCS, , pp.316-323
Abstract:	An important goal in data mining is to generate an abstraction of the data. Such an abstraction helps in reducing the time and space requirements of the overall decision making process. It is also important that the abstraction be generated from the data in small number of scans. In this paper we propose a novel scheme called Prefix-Suffix trees for compact storage of patterns in data mining, which forms an abstraction of the patterns, and which is generated from the data in a single scan. This abstraction takes less amount of space and hence forms a compact storage of patterns. Further, we propose a clustering algorithm based on this storage and prove experimentally that this type of storage reduces the space and time. This has been established by considering large data sets of handwritten numerals namely the OCR data, the MNIST data and the USPS data. The proposed algorithm is compared with other similar algorithms and the efficacy of our scheme is thus established. � Springer-Verlag Berlin Heidelberg 2007.
URI:	http://idr.nitk.ac.in/jspui/handle/123456789/8838
Appears in Collections:	2. Conference Papers

Files in This Item:

File	Description	Size	Format
8838.pdf		306.53 kB	Adobe PDF	View/Open