Impact of Vector Embeddings on the Performance of Tolerance Near Sets-based Sentiment Classifier for Text Classification
No Thumbnail Available
Date
2023
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier B.V.
Abstract
In recent years, Natural Language Processing (NLP) has gained significant attention, and sentiment analysis is an essential subfield of NLP that deals with identifying the sentiment or emotion conveyed in the text. Tolerance near sets (TNS) is a mathematical framework that has shown promising results in sentiment analysis tasks. However, the choice of word embeddings can significantly impact the performance of TNS-based classifiers. This paper investigates the impact of using different embeddings on the performance of tolerance near sets-based sentiment classifiers. This paper compares the use of different embeddings, including DistilBERT, MiniLM, and Word Embeddings, and their combinations, to understand their impact on TNS-based sentiment analysis. The TSC 2.0 model proposed in this paper achieves a weighted F1 score of 92.1% in one of the datasets, an improvement due to the sentence embeddings used. Experimental results have led to the observation that tie-breaking and variance-based classification may have led to a noticeable improvement in cases with more than three. © 2023 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (https://creativecommons.org/licenses/by-nc-nd/4.0)
Description
Keywords
Embeddings, Near Sets, Sentiment Prediction, Text Classification, Tolerance Classes
Citation
Procedia Computer Science, 2023, Vol.225, , p. 645-654
