allenai/pixmo-cap
Image To TextEnglishodc-by
The allenai/pixmo-cap dataset is a English image to text resource from allenai at 2024 comprising 717,042 examples. With 680 downloads and 42 likes, it is actively used by the community. It is released under the odc-by license and is a 100K<n<1M-scale dataset.
About allenai/pixmo-cap
PixMo-Cap
PixMo-Cap is a dataset of very long (roughly 200 words on average), detailed captions.
It can be used to pre-train and fine-tune vision-language models.
PixMo-Cap was created by recording annotators speaking about an image for 60-90 ...
Details
- Task
- Image To Text
- Language
- English
- Format
- Parquet
- Rows / instances
- 717042
- Size
- 100K<n<1M
- Creator
- allenai
- Year
- 2024
- License
- odc-by
- Downloads
- 680
- Likes
- 42