Faculty Publications

Permanent URI for this communityhttps://idr.nitk.ac.in/handle/123456789/18736

Publications by NITK Faculty

Browse

Search Results

Now showing 1 - 10 of 34

Role of intensity of emotions for effective personalized video recommendation: A reinforcement learning approach
(Springer Verlag service@springer.de, 2018) Tripathi, A.; Manasa, D.G.; Rakshitha, K.; Ashwin, T.S.; Reddy, G.
Development of artificially intelligent agents in video recommendation systems over past decade has been an active research area. In this paper, we have presented a novel hybrid approach (combining collaborative as well as content-based filtering) to create an agent which targets the intensity of emotional content present in a video for recommendation. Since cognitive preferences of a user in real world are always in a dynamic state, tracking user behavior in real time as well as the general cognitive preferences of the users toward different emotions is a key parameter for recommendation. The proposed system monitors the user interactions with the recommended video from its user interface and web camera to learn the criterion of decision-making in real time through reinforcement learning. To evaluate the proposed system, we have created our own UI, collected videos from YouTube, and applied Q-learning to train our system to effectively adapt user preferences. © Springer Nature Singapore Pte Ltd. 2018
A novel real-time face detection system using modified affine transformation and Haar cascades
(Springer Verlag service@springer.de, 2019) Sharma, R.; Ashwin, T.S.; Guddeti, R.M.R.
Human Face Detection is an important problem in the area of Computer Vision. Several approaches are used to detect the face for a given frame of an image but most of them fail to detect the faces which are tilted, occluded, or with different illuminations. In this paper, we propose a novel real-time face detection system which detects the faces that are tilted, occluded, or with different illuminations, any difficult pose. The proposed system is a desktop application with a user interface that not only collects the images from web camera but also detects the faces in the image using a Haar-cascaded classifier consisting of Modified Census Transform features. The problem with cascaded classifier is that it does not detect the tilted or occluded faces with different illuminations. Hence to overcome this problem, we proposed a system using Modified Affine Transformation with Viola Jones. Experimental results demonstrate that proposed face detection system outperforms Viola–Jones method by 6% (99.7% accuracy for the proposed system when compare to 93.5% for Voila Jones) with respect to three different datasets namely FDDB, YALE and “Google top 25 ‘tilted face’” image datasets. © Springer Nature Singapore Pte Ltd. 2019
Multimodal group activity state detection for classroom response system using convolutional neural networks
(Springer Verlag service@springer.de, 2019) Sebastian, A.G.; Singh, S.; Manikanta, P.B.T.; Ashwin, T.S.; Guddeti, R.M.R.
Human–Computer Interaction is a crucial and emerging field in computer science. This is because computers are replacing humans in many jobs to provide services. This has resulted in the computer being needed to interact with the human in the same way as the human does with another. When humans talk to each other, they gain feedback based on how the other person responds non-verbally. Since computers are now interacting with humans, they need to be able to detect these facial cues and accordingly adjust their services based on this feedback. Our proposed method aims at building a Multimodal Group Activity State Detection for Classroom Response System which tries to recognize the learning behavior of a classroom for providing effective feedback and inputs to the teacher. The key challenges dealt here are to detect and analyze as many students as possible for a non-biased evaluation of the mood of the students and classify them into three activity states defined: Active, passive, and inactive. © Springer Nature Singapore Pte Ltd. 2019
Ember: A smartphone web browser interface for the blind
(Association for Computing Machinery, 2014) Jassi, I.S.; Ruchika, S.; Pulakhandam, S.; Mukherjee, S.; Ashwin, T.S.; Guddeti, G.R.M.
Ember is a smartphone web browser interface designed exclusively for the blind user. The Ember keypad enables blind users to type using their knowledge of Braille. The interface is intuitive to the blind user because the layout consists of a very few large targets and remains consistent throughout the application. The verbal command option provides another dimension for user-interface interaction. Twelve out of thirteen users found that Ember verbal command navigation was easier than using a traditional web browser. Ten out of thirteen users found it faster to use the Ember tactile method of navigation compared to a traditional web browser. The learning rate for both the tactile and verbal command methods was faster compared to the learning rate associated with a traditional web browser layout. Finally it was seen that five out of five users found it significantly faster to use the Ember keypad compared to the QWERTY keypad. Â© 2014 ACM.
Vision based laser controlled keyboard system for the disabled
(Association for Computing Machinery, 2014) Ahsan, H.; Prabhu, A.; Deeksha, S.D.; Domanal, S.G.; Ashwin, T.S.; Guddeti, G.R.M.
In this paper, we have proposed a novel design for a vision based unistroke keyboard system for the disabled. The keyboard layout considers the commonly used character patterns, which makes it convenient for the user to type. In addition to this, Shift functionality is provided to accommodate a larger set of characters. A webcam is positioned so as to monitor the keyboard and the characters are identified based on the laser pointer which the user can control by minor head movements. Experimental results demonstrate that the design achieves very promising results, thus establishing a baseline for such models in this domain. Â© 2014 ACM.
An android GPS-based navigation application for blind
(Association for Computing Machinery, 2014) Nisha, K.K.; Pruthvi, H.R.; Hadimani, S.N.; Guddeti, G.R.M.; Ashwin, T.S.; Domanal, S.G.
Visual Impairment makes the person depend on another person for all his works and daily chores. Through the application proposed in this paper, we aim to eliminate this dependency of a visually impaired person when travelling from one place to another. The main goal is to provide information regarding the current location, how much distance and time is required to reach the destination as well as provide the user with the directions and turns to be taken while travelling by providing continuous audio feedback in his understandable language. Â© is held by the author/owner(s).
A novel bio-inspired load balancing of virtualmachines in cloud environment
(Institute of Electrical and Electronics Engineers Inc., 2015) Ashwin, T.S.; Domanal, S.G.; Guddeti, G.R.M.
Load Balancing plays an important role in managing the software and the hardware components of cloud. In this present scenario the load balancing algorithm should be efficient in allocating the requested resource and also in the usage of the resources so that the over/underutilization of the resources will not occur in the cloud environment. In the present work, the allocation of all the available Virtual Machines is done in an efficient manner by Particle Swarm Optimization load balancing algorithm. Further, we have used cloudsim simulator to compare and analyze the performance of our algorithm. Simulation results demonstrate that the proposed algorithm distributes the load on all the available virtual machines uniformly i.e, without any under/over utilization and also the average response time is better compared to all existing scheduling algorithms. Â© 2014 IEEE.
Semantic sentiment analysis using context specific grammar
(Institute of Electrical and Electronics Engineers Inc., 2015) Bhuvan, B.M.; Rao, V.D.; Jain, S.; Ashwin, T.S.; Guddeti, G.
The increasing number of e-commerce and social networking sites are producing large amount of data pertaining to reviews of a product, restaurant etc. A keen observation reveals that the text data gathered from any social review site are specific to a context and are subjective in nature promoting varied perceptions of sentiments. The novel idea is to define context specific grammar as semantics for a particular domain. Our research aims to develop a scalable model where features obtained from matching semantic patterns are used to predict the sentiment polarity of movie reviews and also provide a sentiment score for each review. The proposed model is intended to be flexible so that it could be applied to any domain by redefining the semantics specific to that domain. There are many other models which give accuracies greater than 80% using various methods. A study suggests that 70% accurate program is as good as humans as they have varied perceptions of sentiment about a movie review as it is a subjective summary of a movie. Our model might give lesser accuracy but it uses a cognitive approach trying to catch these varied perceptions by learning from a combination of positive and negative grammars. Analyzing results from various experiments we find that Logistic Regression with SGD on Apache Spark performs better with accuracy of 64.12% while being highly scalable. High dependency on the grammars is a limitation of the model. Improvements can be done by defining different quality and quantity of grammars. Â© 2015 IEEE.
A Novel Method for Disease Recognition and Cure Time Prediction Based on Symptoms
(Institute of Electrical and Electronics Engineers Inc., 2015) Shankar, M.; Pahadia, M.; Srivastava, D.; Ashwin, T.S.; Guddeti, G.
Healthcare is a sector where decisions usually have very high-risk and high-cost associated with them. One bad choice can cost a person's life. With diseases like Swine Flu on the rise, which have symptoms quite similar to common cold, it's very difficult for people to differentiate between medical conditions. We propose a novel method for recognition of diseases and prediction of their cure time based on the symptoms. We do this by assigning different coefficients to each symptom of a disease, and filtering the dataset with the severity score assigned to each symptom by the user. The diseases are identified based on a numerical value calculated in the fashion mentioned above. For predicting the cure time of a disease, we use reinforcement learning. Our algorithm takes into account the similarity between the condition of the current user and other users who have suffered from the same disease, and uses the similarity scores as weights in prediction of cure time. We also predict the current medical condition of user relative to people who have suffered from same disease. Â© 2015 IEEE.
Virtual slate: Microsoft kinect based text input tool to improve handwriting of people
(Asia-Pacific Society for Computers in Education No. 300, Jhongda Road, Jhongli City, Taoyuan County 32001, 2016) Ashwin, T.S.; Sreenivasan, K.; Rameez, M.A.; Varma, A.; Mohandoss, V.; Guddeti, G.
Text input is a mundane activity that is very closely associated with Human Computer Interaction. In this paper, using the object tracking facility of the Microsoft Kinect sensor and Tesseract for Optical Character Recognition (OCR), we made it possible to write the text by moving our finger in the air as though we were writing on a virtual slate. One of the main purposes of this proposed work is to help the children so that they can improve their handwriting without somebody to check and monitor their writing activity continuously. Â© 2016 Asia-Pacific Society for Computers in Education. All rights reserved.

Faculty Publications

Browse

Filters

Settings

Sort By

Results per page

Search Results