Sarcasm Detection in Tamil Code-Mixed Data Using Transformers
No Thumbnail Available
Date
2024
Journal Title
Journal ISSN
Volume Title
Publisher
Springer Science and Business Media Deutschland GmbH
Abstract
Social media analytics has been increasingly gaining popularity due to the extensive amount of customer data it offers, benefiting businesses of all sizes, from local ventures to global brands. Analysing textual contents aids context understanding and also enables content moderation to maintain a positive user experience. Sarcasm detection in social media is essential to maintain constructive and respectful online communication, preventing misunderstandings, minimizing conflicts, and fostering a positive and inclusive digital environment. We propose a Transformer based model for sarcasm detection in Tamil code-mixed text. The model consists of two custom-designed layers: Encoder and Embedding layer. It incorporates multi-head self-attention layer and feed-forward neural networks, followed by normalisation and dropout layers. The proposed model has outperformed compared to other state-of-art models for sarcasm detection by achieving an impressive weighted F<inf>1</inf> score of 0.77. This proposed model effectively addressed the unique challenges posed by the Tamil code-mixed text. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
Description
Keywords
Sarcasm, Tamil code-mix, Text Classification, Transformer-based Models
Citation
Communications in Computer and Information Science, 2024, Vol.2046 CCIS, , p. 430-442
