Conference Papers

Permanent URI for this collectionhttps://idr.nitk.ac.in/handle/123456789/28506

Browse

Search Results

Now showing 1 - 2 of 2
  • Item
    IIMH: Intention Identification in Multimodal Human Utterances
    (Association for Computing Machinery, 2023) Keerthan Kumar, T.G.; Dhakate, H.; Koolagudi, S.G.
    Intention identification is a challenging problem in the field of natural language processing, speech processing, and computer vision. People often use contradictory or ambiguous words in different contexts, which can sometimes be very confusing to identify the intention behind an utterance. Intention identification has many practical applications in the fields of natural language processing, sentiment analysis, social media analysis, robotics, and human-computer interaction, where valuable insights into user behavior can be achieved by identifying intention. In this work, we propose a model to determine whether an utterance made by a person is intentional or not intentional. To achieve this, we collected a multimodal dataset containing text, video, and speech from various TV shows, movies, and YouTube videos and labeled them with their corresponding intention. Feature extraction is done at both utterance and word levels to get useful information from all three modalities. We trained the baseline model using SVM to set a benchmark performance. We designed an architecture to detect the contradiction between positive spoken words with negative facial expressions or speech to identify an utterance as non-intentional. Along with the architecture, we used different approaches for classification and got the best results with the Support vector machine (SVM) classifier using RBF kernel, with an accuracy of 78.83% and proven to be better compared to the baseline approach. © 2023 ACM.
  • Item
    SanS: Classification of Sanskrit Mantras using Speech Processing
    (Association for Computing Machinery, 2024) Keerthan Kumar, T.G.; Udaya, S.; Koolagudi, S.G.
    Chandas is a classification metric used in Sanskrit Mantras. They are essentially meant to maintain a form of rhythm for each and every mantra, which gives the hymns their distinctive chanting pattern. Sanskrit hymns (mantras) have many different classifications. Often, before chanting, they are invoked as such in the order of Rishi (author of the hymn), Chanda (rhythm), Devata (god to which invoked), and Viniyoga (use of such hymn). One such example is the Gayatri hymn, which is from Gayatri Chanda. Other examples are verses of the Bhagavad Gita, which are entirely in Anushtup Chanda. Chandas is an essential component of Sanskrit Mantras and forms an integral part of it. Knowledge of Chandas is essential for the study of Sanskrit Poetry, and without knowing what a Chanda is, one cannot analyze Sanskrit hymns. Essentially, Chandas are the meters used to keep track of the mantras. Chandas formulate the rhythm of the mantra and the way that it should be chanted. Based on the number of syllables, there are seven different Chandas, and mantras are usually classified into one of these seven. Identification of Chandas is usually specified in the beginning before chanting; however, in cases where the Chandas are not specified, one may have to do it manually, which may be cumbersome, especially for Chandas with many syllables. In this work, we propose a novel approach called Classification of Sanskrit Mantras using Speech Processing Technology (SanS) to determine the Chanda of a mantra, given the audio file of a Sanskrit mantra and using a Wavenet architecture. The proposed SanS predicts the type of Chanda by counting the number of syllables, giving 81.57% accuracy compared to other works. © 2024 Copyright held by the owner/author(s).