Frame-Level Audio Hate Speech Detection for Kannada Language

Gubbi, A.V.; Pandey, G.; Koolagudi, S.G.

Frame-Level Audio Hate Speech Detection for Kannada Language

dc.contributor.author	Gubbi, A.V.
dc.contributor.author	Pandey, G.
dc.contributor.author	Koolagudi, S.G.
dc.date.accessioned	2026-02-06T06:33:29Z
dc.date.issued	2025
dc.description.abstract	Detecting hate speech in audio has become increasingly challenging due to the increasing use of internet platforms and digital communication. Through this study, we develop an audio-based speech classifier to facilitate the detection of hate speech in the Kannada language. We present an approach to classifying hate speech at the frame level by extracting audio features such as Mel-Frequency Cepstral Coefficients (MFCCs), spectral bandwidth, spectral contrast, and chroma features. Furthermore, we present a custom Kannada hate speech dataset to address the scarcity of resources for hate speech studies in the Kannada language. We collected over 40 minutes of audio samples from YouTube and X (formerly Twitter). Our experiments show that an optimized XGBoost model achieved an accuracy of 73% on the custom dataset for frame-level classification. We also propose a cascading classifiers approach with two classifiers to exploit the locality of hate speech that improves the accuracy to 77%. Finally, we benchmark the proposed model against Logistic Regression, SVM, and XGBoost models. Â© 2025 IEEE.
dc.identifier.citation	2025 International Conference on Artificial Intelligence and Data Engineering, AIDE 2025 - Proceedings, 2025, Vol., , p. 731-736
dc.identifier.uri	https://doi.org/10.1109/AIDE64228.2025.10987338
dc.identifier.uri	https://idr.nitk.ac.in/handle/123456789/28685
dc.publisher	Institute of Electrical and Electronics Engineers Inc.
dc.subject	Audio Classification
dc.subject	Cascading Classifiers
dc.subject	Hate Speech Detection
dc.subject	Kannada Dataset
dc.subject	Machine Learning
dc.title	Frame-Level Audio Hate Speech Detection for Kannada Language

Collections

Conference Papers

Frame-Level Audio Hate Speech Detection for Kannada Language

Files

Collections