Skip to content

Xkev/LLaVA-CoT-100k

Visual Question AnsweringImage Text To TextEN

Created by Xkev at 2024, the Xkev/LLaVA-CoT-100k is a visual question answering dataset in EN in Parquet format.

About Xkev/LLaVA-CoT-100k

Dataset Card for LLaVA-CoT The LLaVA-CoT-100k dataset is introduced in the paper LLaVA-CoT: Let Vision Language Models Reason Step-by-Step. This dataset is designed to enable Vision-Language Models (VLMs) to perform autonomous multistage reason...

Details

Task
Visual Question Answering, Image Text To Text
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
Xkev
Year
2024
Download

Related Visual Question Answering, Image Text To Text datasets

FAQ