Describing Image with Attention based GRU

dc.contributor.author	Mallick, V.R.
dc.contributor.author	Naik, D.
dc.date.accessioned	2026-02-06T06:35:56Z
dc.date.issued	2021
dc.description.abstract	Generating descriptions for images are popular research topic in current world. Based on encoder-decoder model, CNN works as an encoder to encode the images and then passes it to decoder RNN as input to generate the image description in natural language sentences. LSTM is widely used as RNN decoder. Attention mechanism has also played an important role in this field by enhancing the object detection. Inspired by this recent advancement in this field of computer vision, we used GRU in place of LSTM as a decoder for our image captioning model. We incorporated attention mechanism with GRU decoder to enhance the precision of generated captions. GRU have lesser tensor operations in comparison to LSTM, hence it will be faster in training. Â© 2021 IEEE.
dc.identifier.citation	2021 6th International Conference for Convergence in Technology, I2CT 2021, 2021, Vol., , p. -
dc.identifier.uri	https://doi.org/10.1109/I2CT51068.2021.9418171
dc.identifier.uri	https://idr.nitk.ac.in/handle/123456789/30154
dc.publisher	Institute of Electrical and Electronics Engineers Inc.
dc.subject	Attention
dc.subject	Bahdanau attention
dc.subject	Convolutional Neural Network [CNN]
dc.subject	Gated Recurrent Unit [GRU]
dc.subject	InceptionV3
dc.subject	Long Short Term Memory [LSTM]
dc.title	Describing Image with Attention based GRU

Collections