Key points are not available for this paper at this time.
This model enables an individual to input an image and output a description for the same. The research paper makes use of the functionalities of Deep Learning and NLP (Natural Language Processing). Image Caption Generation is an important task as it allows us automate the task of generating captions for any image. This functionality enables us to easily organize files without paying heed to the task of captioning. It is also important for making dynamic web pages. This paper is for people who are visually impaired or suffer from short sightedness. So, rather than looking at an image with trouble they can easily read the caption generated by this model in a larger format. It can also be used to give description of a video in real time on later implementation for a video.
Sehgal et al. (Mon,) studied this question.
Synapse has enriched 4 closely related papers on similar clinical questions. Consider them for comparative context: