allenai/CoSyn-400K
Visual Question AnsweringEnglishodc-by
Created by allenai at 2025, the allenai/CoSyn-400K is a visual question answering dataset in English containing 408,227 records in Parquet format. With 3K downloads and 50 likes, it is actively used by the community. It is released under the odc-by license and is a 100K<n<1M-scale dataset.
About allenai/CoSyn-400K
CoSyn-400k
CoSyn-400k is a collection of synthetic question-answer pairs about very diverse range of computer-generated images.
The data was created by using the Claude large language model to generate code that can be executed to render an im...
Details
- Task
- Visual Question Answering
- Language
- English
- Format
- Parquet
- Rows / instances
- 408227
- Size
- 100K<n<1M
- Creator
- allenai
- Year
- 2025
- License
- odc-by
- Downloads
- 2997
- Likes
- 50