A reinforcement learning and recurrent neural network based dynamic user modeling system

Tripathi, A.; Ashwin, T.S.; Guddeti, R.M.

A reinforcement learning and recurrent neural network based dynamic user modeling system

Date

2018

Authors

Tripathi, A.

Ashwin, T.S.

Guddeti, R.M.

Publisher

Institute of Electrical and Electronics Engineers Inc.

Abstract

With the exponential growth in areas of machine intelligence, the world has witnessed promising solutions to the personalized content recommendation. The ability of interactive learning agents to take optimal decisions in dynamic environments has been very well conceptualized and proven by Reinforcement Learning (RL). The learning characteristics of Deep-Bidirectional Recurrent Neural Networks (DBRNN) in both positive and negative time directions has shown exceptional performance as generative models to generate sequential data in supervised learning tasks. In this paper, we harness the potential of the said two techniques and strive to create personalized video recommendation through emotional intelligence by presenting a novel context-Aware collaborative filtering approach where intensity of users' spontaneous non-verbal emotional response towards recommended video is captured through system-interactions and facial expression analysis for decision-making and video corpus evolution with real-Time data streams. We take into account a user's dynamic nature in the formulation of optimal policies, by framing up an RL-scenario with an off-policy (Q-Learning) algorithm for temporal-difference learning, which is used to train DBRNN to learn contextual patterns and generate new video sequences for the recommendation. Evaluation of our system with real users for a month shows that our approach outperforms state-of-The-Art methods and models a user's emotional preferences very well with stable convergence. Â© 2018 IEEE.

Keywords

Affectiva, Deep Bidirectoinal Recurrent Neural Network, Intensities of Emotions, Multi-Armed bandit, Q-Learning, Reinforcement Learning, Video Recommendation

Citation

Proceedings - IEEE 18th International Conference on Advanced Learning Technologies, ICALT 2018, 2018, Vol., , p. 411-415

URI

https://doi.org/10.1109/ICALT.2018.00103
https://idr.nitk.ac.in/handle/123456789/31345

Collections

Conference Papers

Full item page

A reinforcement learning and recurrent neural network based dynamic user modeling system

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By