Skip to content

HuggingFaceM4/Docmatix

Visual Question AnsweringENmit

Created by HuggingFaceM4 at 2024, the HuggingFaceM4/Docmatix is a visual question answering dataset in EN containing 2,548,360 records in Parquet format. With 13K downloads and 305 likes, it is actively used by the community. It is released under the mit license and is a 1M<n<10M-scale dataset.

About HuggingFaceM4/Docmatix

Dataset Card for Docmatix Dataset description Docmatix is part of the Idefics3 release (stay tuned). It is a massive dataset for Document Visual Question Answering that was used for the fine-tuning of the vision-language model Idefi...

Details

Task
Visual Question Answering
Language
EN
Format
Parquet
Rows / instances
2548360
Size
1M<n<10M
Creator
HuggingFaceM4
Year
2024
License
mit
Downloads
12980
Likes
305
Download Homepage

Related Visual Question Answering datasets

FAQ