Categorizing Relations via Semi-supervised Learning Using a Hybrid Tolerance Rough Sets and Genetic Algorithm Approach

No Thumbnail Available

Date

2022

Journal Title

Journal ISSN

Volume Title

Publisher

Springer Science and Business Media Deutschland GmbH

Abstract

In the last few decades, we have seen a tremendous increase in the amount of data available on the web. There have been significant advances in constructing knowledge bases consisting of relations from the text data. These relations are words in the text often represented as pairs (Noun, Context), for example (Disease, Symptom), which can be classified into some predefined category to give us some useful information. Categorization of relations using tolerance-rough set based semi-supervised learning algorithm (TPL) have been successfully demonstrated in several works. However, an unexplored problem is the automatic selection of hyper parameters of the TPL algorithm. This paper proposes a genetic algorithm-based approach (TPL-GA) for optimizing the hyper-parameters that are fundamental to the TPL algorithm. The proposed approach was tested on two standard datasets drawn from different domains representing two different languages: English and Hindi text. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.

Description

Keywords

Co-occurrence matrix, Genetic algorithm, Semi-supervised learning, Tolerance rough sets, Tolerant pattern learner (TPL)

Citation

Studies in Fuzziness and Soft Computing, 2022, Vol.413, , p. 103-116

Collections

Endorsement

Review

Supplemented By

Referenced By