Faculty Publications

Permanent URI for this communityhttps://idr.nitk.ac.in/handle/123456789/18736

Publications by NITK Faculty

Browse

Search Results

Now showing 1 - 10 of 36
  • Item
    Optimizing Network Lifetime and Energy Consumption in Homogeneous Clustered WSNs Using Quantum PSO Algorithm
    (Springer Science and Business Media Deutschland GmbH, 2021) Kanchan, P.; Shetty D, D.S.
    A Wireless Sensor Network (WSN) is a group of sensors which communicate with each other and perform some specific task. Clustering is used to conserve energy in a WSN. In this work, the aim is to minimize the energy consumption and maximize the network lifetime of a homogeneous WSN using PSO (Particle Swarm Optimization) based Clustering algorithm in conjunction with quantum computing. In quantum computing, a bit is known as a qubit and it can exist in the following states: a ‘0’, a ‘1’ or a superposition of ‘0’ and ‘1’. In this chapter, the Quantum Computing based PSO clustering algorithm for Optimizing Energy consumption and Network lifetime (QCPOEN) algorithm for homogeneous wireless sensor networks is proposed. The proposed algorithm is compared with the PSO-ECHS algorithm and the LEACH algorithm. The superiority of the algorithm can be verified from the results. © 2021, Springer Nature Singapore Pte Ltd.
  • Item
    JSON Document Clustering Based on Structural Similarity and Semantic Fusion
    (Springer Science and Business Media Deutschland GmbH, 2023) Uma Priya, D.; Santhi Thilagam, P.S.
    The emerging drift toward real-time applications generates massive amounts of JSON data exponentially over the web. Dealing with the heterogeneous structures of JSON document collections is challenging for efficient data management and knowledge discovery. Clustering JSON documents has become a significant issue in organizing large data collections. Existing research has focused on clustering JSON documents using structural or semantic similarity measures. However, differently annotated JSON structures are also related by the context of the JSON attributes. As a result, existing research work is unable to identify the context hidden in the schemas, emphasizing the importance of leveraging the syntactic, semantic, and contextual properties of heterogeneous JSON schemas. To address the specific research gap, this work proposes JSON Similarity (JSim), a novel approach for clustering JSON documents by combining the structural and semantic similarity scores of JSON schemas. In order to capture more semantics, the semantic fusion method is proposed, which correlates schemas using semantic as well as contextual similarity measures. The JSON documents are clustered based on the weighted similarity matrix. The results and findings show that the proposed approach outperforms the current approaches significantly. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
  • Item
    A novel data structure for efficient representation of large data sets in data mining
    (2006) Pai, R.M.; Ananthanarayana, V.S.
    An important goal in data mining is to generate an abstraction of the data. Such an abstraction helps in reducing the time and space requirements of the overall decision making process. It is also important that the abstraction be generated from the data in small number of scans. In this paper, we propose a novel data structure called Prefix-Postfix structure(PP-structure), which is an abstraction of the data that can be built by scanning the database only once. We prove that this structure is compact, complete and incremental and therefore is suitable to represent dynamic databases. Further, we propose a clustering algorithm using this structure. The proposed algorithm is tested on different real world datasets and is shown that the algorithm is both space efficient and time efficient for large datasets without sacrificing for the accuracy. We compare our algorithm with other algorithms and show the effectiveness of our algorithm. © 2006 IEEE.
  • Item
    Prefix-Suffix trees: A novel scheme for compact representation of large datasets
    (Springer Verlag, 2007) Pai, R.M.; Ananthanarayana, V.S.
    An important goal in data mining is to generate an abstraction of the data. Such an abstraction helps in reducing the time and space requirements of the overall decision making process. It is also important that the abstraction be generated from the data in small number of scans. In this paper we propose a novel scheme called Prefix-Suffix trees for compact storage of patterns in data mining, which forms an abstraction of the patterns, and which is generated from the data in a single scan. This abstraction takes less amount of space and hence forms a compact storage of patterns. Further, we propose a clustering algorithm based on this storage and prove experimentally that this type of storage reduces the space and time. This has been established by considering large data sets of handwritten numerals namely the OCR data, the MNIST data and the USPS data. The proposed algorithm is compared with other similar algorithms and the efficacy of our scheme is thus established. © Springer-Verlag Berlin Heidelberg 2007.
  • Item
    Medical image segmentation using improved mountain clustering technique version-2
    (2010) Verma, N.K.; Roy, A.; Vasikarla, S.
    This paper proposes Improved Mountain Clustering version-2 (IMC-2) based medical image segmentation. The proposed technique is a more powerful approach for medical image based diagnosing diseases like brain tumor, tooth decay, lung cancer, tuberculosis etc. The IMC-2 based medical image segmentation approach has been applied on various categories of images including MRI images, dental X-rays, chest X-rays and compared with some widely used segmentation techniques such as K-means, FCM and EM as well as with IMC-1. The performance of all these segmentation approaches is compared on widely accepted validation measure, Global Silhouette Index. Also, the segments obtained from the above mentioned segmentation approaches have been visually evaluated. © 2010 IEEE.
  • Item
    Alignment based similarity distance measure for better web sessions clustering
    (Elsevier B.V., 2011) Poornalatha, G.; Raghavendra, P.S.
    The evolution of the internet along with the popularity of the web has attracted a great attention among the researchers to web usage mining. Given that, there is an exponential growth in terms of amount of data available in the web that may not give the required information immediately; web usage mining extracts the useful information from the huge amount of data available in the web logs that contain information regarding web pages accessed. Due to this huge amount of data, it is better to handle small group of data at a time, instead of dealing with entire data together. In order to cluster the data, similarity measure is essential to obtain the distance between any two user sessions. The objective of this paper is to propose a technique, to measure the similarity between any two user sessions based on sequence alignment technique that uses the dynamic programming method. © 2011 Published by Elsevier Ltd.
  • Item
    Web page prediction by clustering and integrated distance measure
    (2012) Poornalatha, G.; Raghavendra, S.R.
    The tremendous progress of the internet and the World Wide Web in the recent era has emphasized the requirement for reducing the latency at the client or the user end. In general, caching and prefetching techniques are used to reduce the delay experienced by the user while waiting to get the web page from the remote web server. The present paper attempts to solve the problem of predicting the next page to be accessed by the user based on the mining of web server logs that maintains the information of users who access the web site. The prediction of next page to be visited by the user may be pre fetched by the browser which in turn reduces the latency for user. Thus analyzing user's past behavior to predict the future web pages to be navigated by the user is of great importance. The proposed model yields good prediction accuracy compared to the existing methods like Markov model, association rule, ANN etc. © 2012 IEEE.
  • Item
    Clustering using levy flight cuckoo search
    (Springer Verlag service@springer.de, 2013) Senthilnath, J.; Das, V.; Omkar, S.N.; Mani, V.
    In this paper, a comparative study is carried using three nature-inspired algorithms namely Genetic Algorithm (GA), Particle Swarm Optimization (PSO) and Cuckoo Search (CS) on clustering problem. Cuckoo search is used with levy flight. The heavy-tail property of levy flight is exploited here. These algorithms are used on three standard benchmark datasets and one real-time multi-spectral satellite dataset. The results are tabulated and analysed using various techniques. Finally we conclude that under the given set of parameters, cuckoo search works efficiently for majority of the dataset and levy flight plays an important role. © 2013 Springer.
  • Item
    Automatic generation of web service composition templates using WSDL descriptions
    (Springer Verlag service@springer.de, 2015) Kamath S․, S.; Alse, S.; Prasad, P.; Chennagiri, A.R.
    Due to the extensive use and increase in the number of published web services, clustering and automatic tagging of web services to facilitate efficient discovery of web services is crucial. Discovering composite services has gained importance as there is a need for integrating web services to meet complex service requirements. In this regard, we propose a system for clustering services based on features extracted from their WSDL documents for generating service tags and then the cluster tags. Also, based on the service requirements specified by the requester, our system can identify and generate potential composite service templates. These are basically the subgraphs of the service dependency graph generated by considering only relevant services determined by matching cluster tags and service tags with the request tokens. It was seen that the search domain for service composition was significantly reduced by clustering and tagging and the system obtained meaningful and encouraging results. © Springer India 2015.
  • Item
    Genetic algorithm based wrapper feature selection on hybrid prediction model for analysis of high dimensional data
    (Institute of Electrical and Electronics Engineers Inc., 2015) Rayasam, R.C.; Kannan, R.; Patil, N.
    Data mining concepts have been extensively used for disease prediction in the medical field. Many Hybrid Prediction Models (HPM) have been proposed and implemented in this area, however, there is always a need for increasing accuracy and efficiency. The existing methods take into account all the features to build the classifier model thus reducing the accuracy and increasing the overall processing time. This paper proposes a Genetic Algorithm based Wrapper feature selection Hybrid Prediction Model (GWHPM). This model initially uses k-means clustering technique to remove the outliers from the dataset. Further, an optimal set of features are obtained by using Genetic Algorithm based Wrapper feature selection. Finally, it is used to build the classifier models such as Decision Tree, Naive Bayes, k nearest neighbor and Support Vector Machine. A comparative study of GWHPM is carried out and it is observed that the proposed model performed better than the existing methods. © 2014 IEEE.