Categorizing Relations via Semi-supervised Learning Using a Hybrid Tolerance Rough Sets and Genetic Algorithm Approach
No Thumbnail Available
Date
2022
Journal Title
Journal ISSN
Volume Title
Publisher
Springer Science and Business Media Deutschland GmbH
Abstract
In the last few decades, we have seen a tremendous increase in the amount of data available on the web. There have been significant advances in constructing knowledge bases consisting of relations from the text data. These relations are words in the text often represented as pairs (Noun, Context), for example (Disease, Symptom), which can be classified into some predefined category to give us some useful information. Categorization of relations using tolerance-rough set based semi-supervised learning algorithm (TPL) have been successfully demonstrated in several works. However, an unexplored problem is the automatic selection of hyper parameters of the TPL algorithm. This paper proposes a genetic algorithm-based approach (TPL-GA) for optimizing the hyper-parameters that are fundamental to the TPL algorithm. The proposed approach was tested on two standard datasets drawn from different domains representing two different languages: English and Hindi text. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Description
Keywords
Co-occurrence matrix, Genetic algorithm, Semi-supervised learning, Tolerance rough sets, Tolerant pattern learner (TPL)
Citation
Studies in Fuzziness and Soft Computing, 2022, Vol.413, , p. 103-116
