Key points are not available for this paper at this time.
We present a new dataset of image caption annotations, Conceptual Captions, which contains an order of magnitude more images than the MS-COCO dataset We achieve this by extracting and filtering image caption annotations from billions of webpages. We also present quantitative evaluations of a number of image captioning models and show that a model architecture based on Inception- ResNet-v2 (Szegedy et al., 2016) for image-feature extraction and Transformer
Building similarity graph...
Analyzing shared references across papers
Loading...
Piyush Sharma
Sharda University
Nan Ding
Beihang University
Sebastian Goodman
Google (United States)
Google (United States)
Building similarity graph...
Analyzing shared references across papers
Loading...
Sharma et al. (Mon,) studied this question.
synapsesocial.com/papers/69dacb5034ded318bb68488f — DOI: https://doi.org/10.18653/v1/p18-1238