Skip to content

kdexd/red_caps

Image To TextEN

Kdexd/red_caps is a image to text-focused dataset in EN distributed in Parquet format.

About kdexd/red_caps

RedCaps is a large-scale dataset of 12M image-text pairs collected from Reddit. Images and captions from Reddit depict and describe a wide variety of objects and scenes. The data is collected from a manually curated set of subreddits (350 total), ...

Details

Task
Image To Text
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
kdexd
Year
2022
Download

Related Image To Text datasets

FAQ