Findings of the Second Shared Task on Offensive Span Identification from Code-Mixed Tamil-English Comments

dc.contributor.authorRavikiran, M.
dc.contributor.authorGanesh, A.
dc.contributor.authorAnand Kumar, M.
dc.contributor.authorRajalakshmi, R.
dc.contributor.authorChakravarthi, B.R.
dc.date.accessioned2026-02-06T06:34:36Z
dc.date.issued2023
dc.description.abstractMaintaining effective control over offensive content is essential on social media platforms to foster constructive online discussions. Yet, when it comes to code-mixed Dravidian languages, the current prevalence of offensive content moderation is restricted to categorizing entire comments, failing to identify specific portions that contribute to the offensiveness. Such limitation is primarily due to the lack of annotated data and open source systems for offensive spans. To alleviate this issue, in this shared task, we offer a collection of Tamil-English code-mixed social comments that include offensive comments. This paper provides an overview of the released dataset, the algorithms employed, and the outcomes achieved by the systems submitted for this task. © DravidianLangTech 2023 - 3rd Workshop on Speech and Language Technologies for Dravidian Languages, associated with 14th International Conference on Recent Advances in Natural Language Processing, RANLP 2023 - Proceedings.
dc.identifier.citationDravidianLangTech 2023 - 3rd Workshop on Speech and Language Technologies for Dravidian Languages, associated with 14th International Conference on Recent Advances in Natural Language Processing, RANLP 2023 - Proceedings, 2023, Vol., , p. 52-58
dc.identifier.urihttps://doi.org/10.26615/978-954-452-085-4_007
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/29331
dc.publisherIncoma Ltd
dc.titleFindings of the Second Shared Task on Offensive Span Identification from Code-Mixed Tamil-English Comments

Files