Describing Image with Attention based GRU
No Thumbnail Available
Date
2021
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Institute of Electrical and Electronics Engineers Inc.
Abstract
Generating descriptions for images are popular research topic in current world. Based on encoder-decoder model, CNN works as an encoder to encode the images and then passes it to decoder RNN as input to generate the image description in natural language sentences. LSTM is widely used as RNN decoder. Attention mechanism has also played an important role in this field by enhancing the object detection. Inspired by this recent advancement in this field of computer vision, we used GRU in place of LSTM as a decoder for our image captioning model. We incorporated attention mechanism with GRU decoder to enhance the precision of generated captions. GRU have lesser tensor operations in comparison to LSTM, hence it will be faster in training. © 2021 IEEE.
Description
Keywords
Attention, Bahdanau attention, Convolutional Neural Network [CNN], Gated Recurrent Unit [GRU], InceptionV3, Long Short Term Memory [LSTM]
Citation
2021 6th International Conference for Convergence in Technology, I2CT 2021, 2021, Vol., , p. -
