Please use this identifier to cite or link to this item:
Title: Constructing an enriched domain taxonomy for Hindi using word embeddings
Authors: Keshava, V.
Avvara, P.
Sowmya, Kamath S.
Geetha, V.
Issue Date: 2018
Citation: Proceedings of the 2017 International Conference on Asian Language Processing, IALP 2017, 2018, Vol.2018-January, , pp.127-130
Abstract: Domain-specific taxonomies constitute a valuable resource as they offer extensive support in information retrieval related activities like browsing, searching, recommendations and personalization. Such taxonomies can bridge the gap between the lack of domain-specific querying knowledge in potential users and the actual content. In case of multilingual content, taxonomies can play a pivotal role in boosting search performance for content across language barriers. In this paper, a domain-agnostic framework for building an evolving, domain-specific taxonomy for the Hindi, given a set of well-organized data points is proposed. The approach is intended for designing a hierarchical taxonomy enriched with synonyms and other morphological variants using WordNet and Word2vec models respectively. The hierarchical structure acts as a base which binds the taxonomy to a given domain. Such enrichment can improve taxonomy coverage within the given domain. The focus is also on building a taxonomy that can self-evolve over time, with high precision and recall, with minimal manual effort. � 2017 IEEE.
Appears in Collections:2. Conference Papers

Files in This Item:
File Description SizeFormat 
1 Constructing an Enriched Domain.pdf144.57 kBAdobe PDFThumbnail

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.