HuggingFaceM4/Docmatix
Visual Question AnsweringENmit
Created by HuggingFaceM4 at 2024, the HuggingFaceM4/Docmatix is a visual question answering dataset in EN containing 2,548,360 records in Parquet format. With 13K downloads and 305 likes, it is actively used by the community. It is released under the mit license and is a 1M<n<10M-scale dataset.
About HuggingFaceM4/Docmatix
Dataset Card for Docmatix
Dataset description
Docmatix is part of the Idefics3 release (stay tuned).
It is a massive dataset for Document Visual Question Answering that was used for the fine-tuning of the vision-language model Idefi...
Details
- Task
- Visual Question Answering
- Language
- EN
- Format
- Parquet
- Rows / instances
- 2548360
- Size
- 1M<n<10M
- Creator
- HuggingFaceM4
- Year
- 2024
- License
- mit
- Downloads
- 12980
- Likes
- 305