Frame-Level Audio Hate Speech Detection for Kannada Language

dc.contributor.authorGubbi, A.V.
dc.contributor.authorPandey, G.
dc.contributor.authorKoolagudi, S.G.
dc.date.accessioned2026-02-06T06:33:29Z
dc.date.issued2025
dc.description.abstractDetecting hate speech in audio has become increasingly challenging due to the increasing use of internet platforms and digital communication. Through this study, we develop an audio-based speech classifier to facilitate the detection of hate speech in the Kannada language. We present an approach to classifying hate speech at the frame level by extracting audio features such as Mel-Frequency Cepstral Coefficients (MFCCs), spectral bandwidth, spectral contrast, and chroma features. Furthermore, we present a custom Kannada hate speech dataset to address the scarcity of resources for hate speech studies in the Kannada language. We collected over 40 minutes of audio samples from YouTube and X (formerly Twitter). Our experiments show that an optimized XGBoost model achieved an accuracy of 73% on the custom dataset for frame-level classification. We also propose a cascading classifiers approach with two classifiers to exploit the locality of hate speech that improves the accuracy to 77%. Finally, we benchmark the proposed model against Logistic Regression, SVM, and XGBoost models. © 2025 IEEE.
dc.identifier.citation2025 International Conference on Artificial Intelligence and Data Engineering, AIDE 2025 - Proceedings, 2025, Vol., , p. 731-736
dc.identifier.urihttps://doi.org/10.1109/AIDE64228.2025.10987338
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/28685
dc.publisherInstitute of Electrical and Electronics Engineers Inc.
dc.subjectAudio Classification
dc.subjectCascading Classifiers
dc.subjectHate Speech Detection
dc.subjectKannada Dataset
dc.subjectMachine Learning
dc.titleFrame-Level Audio Hate Speech Detection for Kannada Language

Files