Skip to content

allenai/pixmo-cap

Image To TextEnglishodc-by

The allenai/pixmo-cap dataset is a English image to text resource from allenai at 2024 comprising 717,042 examples. With 680 downloads and 42 likes, it is actively used by the community. It is released under the odc-by license and is a 100K<n<1M-scale dataset.

About allenai/pixmo-cap

PixMo-Cap PixMo-Cap is a dataset of very long (roughly 200 words on average), detailed captions. It can be used to pre-train and fine-tune vision-language models. PixMo-Cap was created by recording annotators speaking about an image for 60-90 ...

Details

Task
Image To Text
Language
English
Format
Parquet
Rows / instances
717042
Size
100K<n<1M
Creator
allenai
Year
2024
License
odc-by
Downloads
680
Likes
42
Download Homepage

Related Image To Text datasets

FAQ