Yv, S.S.Choubey, Y.Naik, D.2026-02-062021Proceedings - 5th International Conference on Computing Methodologies and Communication, ICCMC 2021, 2021, Vol., , p. 1051-1055https://doi.org/10.1109/ICCMC51019.2021.9418347https://idr.nitk.ac.in/handle/123456789/30149Defining the content of an image automatically in Artificial Intelligence is basically a rudimentary problem that connects computer vision and NLP (Natural Language Processing). In the proposed work, a generative model is presented by combining the recent developments in machine learning and computer vision based on a deep recurrent architecture that describes the image using natural language phrases. By integrating the training picture, the trained model maximizes the likelihood of the target description sentence. The efficiency of the model, its accuracy and the language it learns is only dependent on the image descriptions, which was demonstrated by experiments performed on several datasets. © 2021 IEEE.AttentionCaptionComponentDecoderDenseEncoderFormattingGated Recurrent Unit(GRU)Image Captioning with Attention Based Model