Faculty Publications
Permanent URI for this communityhttps://idr.nitk.ac.in/handle/123456789/18736
Publications by NITK Faculty
Browse
11 results
Search Results
Item Clustering and bootstrapping based framework for news knowledge base completion(Slovak Academy of Sciences, 2021) Srinivasa, K.; Santhi Thilagam, P.S.Extracting the facts, namely entities and relations, from unstructured sources is an essential step in any knowledge base construction. At the same time, it is also necessary to ensure the completeness of the knowledge base by incrementally extracting the new facts from various sources. To date, the knowledge base completion is studied as a problem of knowledge refinement where the missing facts are inferred by reasoning about the information already present in the knowledge base. However, facts missed while extracting the information from multilingual sources are ignored. Hence, this work proposed a generic framework for knowledge base completion to enrich a knowledge base of crime-related facts extracted from online news articles in the English language, with the facts extracted from low resourced Indian language Hindi news articles. Using the framework, information from any low-resourced language news articles can be extracted without using language-specific tools like POS tags and using an appropriate machine translation tool. To achieve this, a clustering algorithm is proposed, which explores the redundancy among the bilingual collection of news articles by representing the clusters with knowledge base facts unlike the existing Bag of Words representation. From each cluster, the facts extracted from English language articles are bootstrapped to extract the facts from comparable Hindi language articles. This way of bootstrapping within the cluster helps to identify the sentences from a low-resourced language that are enriched with new information related to the facts extracted from a high-resourced language like English. The empirical result shows that the proposed clustering algorithm produced more accurate and high-quality clusters for monolingual and cross-lingual facts, respectively. Experiments also proved that the proposed framework achieves a high recall rate in extracting the new facts from Hindi news articles. © 2021 Slovak Academy of Sciences. All rights reserved.Item FLAG: fuzzy logic augmented game theoretic hybrid hierarchical clustering algorithm for wireless sensor networks(Springer, 2022) Naik, C.; Shetty D, P.D.Stability of the wireless sensor network (WSN) is the most critical factor in real-time and data-sensitive applications like military and surveillance systems. Many energy optimization techniques and algorithms have been proposed to extend the stability of a wireless sensor network. Clustering is a well regarded method in the research communities among them. Hence, this paper presents hybrid hierarchical artificial intelligence based clustering techniques, named FLAG and I-FLAG. The first phase of these algorithms use game-theoretic technique to elect suitable cluster heads (CHs) and later phase of the algorithms use fuzzy inference system to select appropriate super cluster heads (SCHs) among CHs. The I-FLAG is an improved version of FLAG where additional parameters like energy and distance are considered to elect CHs. Simulations are performed to check superiority of the proposed algorithms over the existing protocols like LEACH, CHEF, and CROSS. Simulation results show that the average stability period of WSN is better in FLAG and I-FLAG compared to other protocols, and so is the throughput of WSN during the stability period. © 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.Item Monitoring COVID-19 Cases and Vaccination in Indian States and Union Territories Using Unsupervised Machine Learning Algorithm(Springer Science and Business Media Deutschland GmbH, 2023) Chakraborty, S.The worldwide spread of the novel coronavirus originating from Wuhan, China led to an ongoing pandemic as COVID-19. The disease being a contagion transmitted rapidly in India through the people having travel histories to the affected countries, and their contacts that tested positive. Millions of people across all states and union territories (UT) were affected leading to serious respiratory illness and deaths. In the present study, two unsupervised clustering algorithms namely k-means clustering and hierarchical agglomerative clustering are applied on the COVID-19 dataset in order to group the Indian states/UTs based on the pandemic effect and the vaccination program from the period of March, 2020 to early June, 2021. The aim of the study is to observe the plight of each state and UT of India combating the novel coronavirus infection and to monitor their vaccination status. The research study will be helpful to the government and to the frontline workers coping to restrict the transmission of the virus in India. Also, the results of the study will provide a source of information for future research regarding the COVID-19 pandemic in India. © 2022, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.Item Canopy centre-based fuzzy-C-means clustering for enhancement of soil fertility prediction(Inderscience Publishers, 2024) Sujatha, M.; Jaidhar, C.D.For plants to develop, fertile soil is necessary. Estimating soil parameters based on time change is crucial for enhancing soil fertility. Sentinel-2’s remote sensing technology produces images that can be used to gauge soil parameters. In this study, values for soil parameters such as electrical conductivity, pH, organic carbon, and nitrogen are derived using Sentinel-2 data. In order to increase the clustering accuracy, this study suggests using Canopy centre-based fuzzy-C-means clustering and comparing it to manual labelling and other clustering techniques such as Canopy, density-based, expectation-maximisation, farthest-first, k-means, and fuzzy-C-means clustering, its usefulness is demonstrated. The proposed clustering achieved the highest clustering accuracy of 78.42%. Machine learning-based classifiers were applied to classify soil fertility, including Naive Bayes, support vector machine, decision trees, and random forest (RF). Dataset labelled with the proposed RF clustering classifier achieves a high classification accuracy of 99.69% with ten-fold cross-validation. © 2024 Inderscience Enterprises Ltd.. All rights reserved.Item Cluster-Based Multi-attribute Routing Protocol for Underwater Acoustic Sensor Networks(Springer, 2024) Nazareth, P.; Chandavarkar, B.R.Underwater Acoustic Sensor Networks play a significant role in various underwater applications. There are several challenges in underwater communications like high bit-error-rate, low bandwidth, high energy consumption, void-node during routing, etc. Handling void-node during routing is a major challenge in underwater routing. There are well-known void-handling protocols like Energy-efficient Void-Aware Geographic Routing protocol, HydroCast, etc. However, these routing protocols require all neighboring nodes must be a part of the cluster which increases the overhead on clustering, or void-node has a part of the routing. This paper proposes an underwater routing protocol referred to as Cluster-based Multi-Attribute Routing (CMAR) to overcome these issues. It is a sender-based, opportunistic underwater routing protocol. CMAR uses the Technique for Order of Preference by Similarity to Ideal Solution to evaluate the suitability of the neighboring nodes and the basis for clustering process initialization. Through MATLAB simulations, the performance of the CMAR is compared with HydroCast in terms of the number of nodes selected in the forwarding set, number of clusters formed, number of times void-node becomes part of routing and transmission reliability. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.Item Priority-Driven Resource Allocation and Power Optimization in D2D Communication(Institute of Electrical and Electronics Engineers Inc., 2024) Raghu, T.V.; Manjappa, M.This research proposes priority-driven application-based channel assignment and power optimization frameworks called Channel State Information-based Resource Allocation (CSIRA) and Binary Search Power Control Mechanism (BSPCM) in D2D-enabled cellular communication. The CSIRA framework is cluster-based and uses a K-means clustering algorithm to group the D2D users into clusters. CSIRA allows the D2D users to share the cellular user's resources without compromising the cellular user's Quality of Service (QoS) in each cluster. Also, CSIRA ensures that public safety communication will get an edge over commercial communication during resource allocation. In order to ensure the QoS for cellular users is maintained while also enhancing the sum rate of D2D communication, the CSIRA employs the BSPCM framework. BSPCM framework utilizes a binary search algorithm to determine the optimal transmission power required for guaranteed D2D transmission within a cluster, thereby mitigating interference effects. A theoretical proof is provided to show that the suggested frameworks converge to a stable matching and end after a finite number of iterations. Simulation results demonstrate that the proposed frameworks effectively prioritizes public safety over commercial applications while preserving optimal system efficiency and quality with minimal complications. © 2017 IEEE.Item Solar irradiation forecast enhancement using clustering based CNN-BiLSTM-attention hybrid architecture with PSO(Taylor and Francis Ltd., 2024) Chiranjeevi, M.; Madyastha, A.; Maurya, A.K.; Moger, T.; Jena, D.Accurate solar irradiation forecasting is essential for optimising solar energy use. This paper presents a novel forecasting approach: the ‘Clustering-based CNN-BiLSTM-Attention Hybrid Architecture with PSO’. It combines clustering, attention mechanisms, Convolutional Neural Networks (CNN), Bidirectional Long-Short Term Memory (BiLSTM) networks, and Particle Swarm Optimisation (PSO) into a unified framework. Clustering categorises days into groups, improving predictive capabilities. The CNN-BiLSTM model captures spatial and temporal features, identifying complex patterns. PSO optimises the hybrid model’s hyperparameters, while an attention mechanism assigns probability weights to relevant information, enhancing performance. By leveraging spatial and temporal patterns in solar data, the proposed model improves forecasting accuracy in univariate and multivariate analyses with multi-step predictions. Extensive tests on real-world datasets from various locations show the model’s effectiveness. For example, with NASA power data, the model achieves a Mean Absolute Error (MAE) of 24.028 W/m2, Root Mean Square Error (RMSE) of 43.025 W/m2, and an R2 score of 0.984 for 1-hour ahead forecasting. The results show significant improvements over conventional methods. © 2024 Informa UK Limited, trading as Taylor & Francis Group.Item An Efficient Application based Many-to-Many Resource Allocation and Sharing with Power Optimization for D2D Communication - A Clustered Approach(Korean Institute of Communications and Information Sciences, 2024) Raghu, R.T.; Manjappa, K.This study aims to give an edge to public safety applications over commercial applications in an underlay cellular-assisted device-to-device (D2D) communication. The proposed framework introduces two frameworks: Cluster-based many-to-many resource allocation and resource sharing framework (CMMRARS) and constant time power control algorithm (CTPCA). The RB assigned to a CUE can share with multiple DUE pairs, and the DUE pairs can also use RB assigned to multiple CUEs under the many-to-many strategy. The CMMRARS framework is responsible for resource allocation and resource sharing and accordingly, it is further divided into three sub-problems. The CTPCA framework is divided into two sub-problems and used to find optimal power for cellular users and D2D transmitters to avoid cross-tier and co-tier interference. The K-means clustering algorithm is employed to form application-specific clusters, and it ensures that more cellular users fall into the public safety clusters so that the D2D users will get more resource-sharing options. Cellular users use a weighted bipartite graph to form a priority list of D2D users for resource sharing. The main objective of the proposed work is to enhance the system’s sum rate by simultaneously reusing the same resource by multiple D2D pairs and safeguarding the Quality of Services provided to all kinds of network users. A theoretical justification is presented to ensure that the proposed frameworks terminate after a certain number of runs and congregate to a consistent matching. Simulation results show that the proposed method influences the overall system’s sum rate and provides a preference for public safety applications over commercial applications. © 2024 KICS.Item IoT energy efficiency routing protocol using FHO-based clustering and improved CSO model-based routing in MANET(John Wiley and Sons Ltd, 2024) Sanshi, S.; Karthik, N.; Vatambeti, R.Many protocols, services, and electrical devices with built-in sensors have been developed in response to the rapid expansion of the Internet of Things. Mobile ad hoc networks (MANETs) consist of a collection of autonomous mobile nodes that can form an ad hoc network in the absence of any pre-existing infrastructure. System performance may suffer due to the changeable topology of MANETs. Since most mobile hosts operate on limited battery power, energy consumption poses the biggest challenge for MANETs. Both network lifetime and throughput improve when energy usage is reduced. However, existing approaches perform poorly in terms of energy efficiency. Scalability becomes a significant issue in large-scale networks as they grow, leading to overhead associated with routing updates and maintenance that can become unmanageable. This article employs a MANET routing protocol combined with an energy conservation strategy. The clustering hierarchy is used in MANETs to maximize the network's lifespan, considering its limited energy resources. In the MANET communication process, the cluster head (CH) is selected using Fire Hawk Optimization (FHO). When choosing nodes to act as a cluster for an extended period, CH election factors in connectivity, mobility, and remaining energy. This process is achieved using an optimized version of the Ad hoc On-Demand Distance Vector (AODV) routing protocol, utilizing Improved Chicken Swarm Optimization (ICSO). In comparison to existing protocols and optimization techniques, the proposed method offers an extended network lifespan ranging from 90 to 160 h and reduced energy consumption of 80 to 110 J, as indicated by the implementation results. © 2024 John Wiley & Sons Ltd.Item JSON document clustering based on schema embeddings(SAGE Publications Ltd, 2024) Uma Priya, D.U.; Santhi Thilagam, P.S.The growing popularity of JSON as the data storage and interchange format increases the availability of massive multi-structured data collections. Clustering JSON documents has become a significant issue in organising large data collections. Existing research uses various structural similarity measures to perform clustering. However, differently annotated JSON structures may also encode semantic relatedness, necessitating the use of both syntactic and semantic properties of heterogeneous JSON schemas. Using the SchemaEmbed model, this paper proposes an embedding-based clustering approach for grouping contextually similar JSON documents. The SchemaEmbed model is designed using the pre-trained Word2Vec model and a deep autoencoder that considers both syntactic and semantic information of JSON schemas for clustering the documents. The Word2Vec model learns the attribute embeddings, and a deep autoencoder is designed to generate context-aware schema embeddings. Finally, the context-based similar JSON documents are grouped using a clustering algorithm. The effectiveness of the proposed work is evaluated using both real and synthetic datasets. The results and findings show that the proposed approach improves clustering quality significantly, with a high NMI score of 75%. In addition, we demonstrate that clustering results obtained by contextual similarity are superior to those obtained by traditional semantic similarity models. © The Author(s) 2022.
