Conference Papers
Permanent URI for this collectionhttps://idr.nitk.ac.in/handle/123456789/28506
Browse
4 results
Search Results
Item Covid-19 Fake News Detector using Hybrid Convolutional and Bi-LSTM Model(Institute of Electrical and Electronics Engineers Inc., 2021) Surendran, P.; Balamuralidhar, B.; Kambham, H.; Anand Kumar, M.Fake news is essentially incorrect and deceiving information presented to the public as news with the motive of tarnishing the reputations of individuals and organizations. In today's world, where we are so closely connected due to the internet, we see a boom in the development of social networking platforms and, thus, the amount of news circulated over the internet. We must keep in mind that fake news circulated on social media and other platforms can cause problems and false alarms in society. In some cases, false information can cause panic and have a dangerous effect on society and the people who believe it to be true. Along with the virus, the Covid-19 pandemic has also brought on distribution and spreading of misinformation. Claims of fake cures, wrong interpretations of government policies, false statistics, etc., bring about a need for a fact-checking system that keeps the circulating news in control. This work examines multiple models and builds an Artificial Intelligence system to detect Covid-19 fake news using a deep neural network. © 2021 IEEE.Item Long Short Term Memory Networks for Lexical Normalization of Tweets(Institute of Electrical and Electronics Engineers Inc., 2021) Nayak, P.; Praueeth, G.; Kulkarni, R.; Anand Kumar, M.Lexical normalization is converting a non-standard text into a standard text that is more readable and universal. Data obtained from social media sites and tweets often contain much noise and use non-canonical sentence structures such as non-standard abbrevlatlons, skipping of words, spelling errors, etc. Hence such data needs to be appropriately processed before it can be used. The processing can be done by lexical normalization, which reduces randomness and converts the sentence structure to a predefined standard. Hence. lexical normalization can help in improving the performance of systems that use user-generated text as inputs. There are several ways to perform lexical normalization, such as dictionary lookups, most frequent replacements, etc. However, VVe aim to explore the domain of deep learning to find approaches that can be used to normalize texts lexically. © 2021 IEEE.Item LSTM-Attention Architecture for Online Bilingual Sexism Detection(CEUR-WS, 2023) Ravi, S.; Kelkar, S.; Anand Kumar, M.The paper describes the results submitted by ‘Team-SMS’ at EXIST 2023. A dataset of 6920 tweets for training, 1038 for validation, and 2076 tweets for testing was provided by the task organizers to train and test our models. Our models include LSTM models coupled with attention layers and without attention. For calculation of soft scores according to the task we tried to mimic human performance by taking an average of different machine learning model predictions using Multinomial Naive Bayes, Linear Support Vector Classifier, Multi Layer Perceptron, XGBoost, LSTM using GloVe embeddings, and LSTM using fastText embeddings. We discuss our approach to remove the ambiguity in the labeling process and detailed description of our work. © 2023 Copyright for this paper by its authors.Item Hate Speech Detection Using Audio in Portuguese Language(Springer Science and Business Media Deutschland GmbH, 2024) Tembe, L.A.; Anand Kumar, M.This study focuses on hate speech in Portuguese language using audio and introduces a novel methodology that integrates audio-to-text and self-image technologies to effectively tackle this problem. We utilize Machine Learning and Deep Learning models to differentiate between hate speech and normal speech. The research utilized a total of 200 datasets, which were categorized into hate speech and normal speech. These datasets were collected by me personally for this project. Four distinct models are presented in the analysis: LSTM, SVM, CNN, and Random Forest. The findings highlight the superior performance of the CNN model when applied to spectrogram data, achieving an accuracy rate of 90%. Conversely, the Random Forest model outperforms others when dealing with text data, achieving an impressive accuracy rate of 73.1%. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
