Key points are not available for this paper at this time.
We present a new dataset of image caption annotations, Conceptual Captions, which contains an order of magnitude more images than the MS-COCO dataset We achieve this by extracting and filtering image caption annotations from billions of webpages. We also present quantitative evaluations of a number of image captioning models and show that a model architecture based on Inception- ResNet-v2 (Szegedy et al., 2016) for image-feature extraction and Transformer
Building similarity graph...
Analyzing shared references across papers
Loading...
Sharma et al. (Mon,) studied this question.
synapsesocial.com/papers/69dacb5034ded318bb68488f — DOI: https://doi.org/10.18653/v1/p18-1238
Piyush Sharma
Nan Ding
Sebastian Goodman
Google (United States)
Building similarity graph...
Analyzing shared references across papers
Loading...