Skip to content

alibaba-multimodal-industrial-ai/IndustryBench-MIPU

Image To TextVisual Question AnsweringZHmit

Created by alibaba-multimodal-industrial-ai at 2026, the alibaba-multimodal-industrial-ai/IndustryBench-MIPU is a image to text dataset in ZH in Parquet format. With 20.3K downloads and 7 likes, it is actively used by the community. It is released under the mit license and is a 10K<n<100K-scale dataset.

About alibaba-multimodal-industrial-ai/IndustryBench-MIPU

IndustryBench-MIPU: Benchmarking Multi-Image Attribute Value Extraction for Industrial Products Multi-Image Industrial Product Understanding Benchmark — evaluating MLLMs on structured attribute extraction from real-world industrial product imag...

Details

Task
Image To Text, Visual Question Answering
Language
ZH
Format
Parquet
Rows / instances
N/A
Size
10K<n<100K
Creator
alibaba-multimodal-industrial-ai
Year
2026
License
mit
Downloads
20300
Likes
7
Download Homepage

Related Image To Text, Visual Question Answering datasets

FAQ