Classification of Censored Tweets in Chinese Language using XLNet

No Thumbnail Available

Date

2021

Journal Title

Journal ISSN

Volume Title

Publisher

Association for Computational Linguistics (ACL)

Abstract

In the growth of today’s world and advanced technology, social media networks play a significant role in impacting human lives. Censorship is the overthrowing of speech, public transmission, or other details that play a vast role in social media. The content may be considered harmful, sensitive, or inconvenient. Authorities like institutes, governments, and other organizations conduct Censorship. This paper has implemented a model that helps classify censored and uncensored tweets as a binary classification. The paper describes submission to the Censorship shared task of the NLP4IF 2021 workshop. We used various transformer-based pre-trained models, and XLNet outputs a better accuracy among all. We fine-tuned the model for better performance and achieved a reasonable accuracy, and calculated other performance metrics. © 2021 Association for Computational Linguistics.

Description

Keywords

Citation

NLP4IF 2021 - NLP for Internet Freedom: Censorship, Disinformation, and Propaganda, Proceedings of the 4th Workshop, 2021, Vol., , p. 136-139

Endorsement

Review

Supplemented By

Referenced By