Skip to content

Common Objects in Context (COCO)

Automatic Image CaptioningEnglish

Common Objects in Context (COCO) is a automatic image captioning dataset in English from Lin et al. with 330 records in JSON, JPG format.

About Common Objects in Context (COCO)

COCO is a large-scale object detection, segmentation, and captioning dataset. Dataset contains 330K images (>200K labeled) 1.5 million object instances, 80 object categories, 91 stuff categories, 5 captions per image.

Details

Task
Automatic Image Captioning
Language
English
Format
JSON, JPG
Rows / instances
330
Creator
Lin et al.
Year
2014
Download Paper

Related Automatic Image Captioning datasets

FAQ