Question 1

What is the tomg-group-umd/pixelprose dataset?

Accepted Answer

From Pixels to Prose: A Large Dataset of Dense Image Captions

[ arXiv paper ] ｜ [ 🌮 image tars ]
PixelProse is a comprehensive dataset of over 16M (million) synthetically generated captions, 
leveraging cutting-edge vision-language models (Gemi...

Question 2

Is tomg-group-umd/pixelprose a benchmark?

Accepted Answer

tomg-group-umd/pixelprose is a dataset for training or evaluation; it isn't tracked as a standard LLM benchmark in our catalog.

Question 3

Where can I download tomg-group-umd/pixelprose?

Accepted Answer

tomg-group-umd/pixelprose is available at its source: https://huggingface.co/datasets/tomg-group-umd/pixelprose.

tomg-group-umd/pixelprose

About tomg-group-umd/pixelprose

Details

FAQ