Skip to content

CSU-JPG/TextAtlas5M

Text To ImageENmit

CSU-JPG/TextAtlas5M is a text to image-focused dataset in EN that provides 5,398,826 labeled examples distributed in Parquet format. It is distributed under the mit license and falls in the 1M<n<10M size category, and has been downloaded 3.8K times.

About CSU-JPG/TextAtlas5M

TextAtlas5M This dataset is a training set for TextAtlas. Paper: https://huggingface.co/papers/2502.07870 (All the data in this repo is uploaded :>) Dataset subsets Subsets in this dataset are CleanTextSynth, PPT2Details, PPT2Structu...

Details

Task
Text To Image
Language
EN
Format
Parquet
Rows / instances
5398826
Size
1M<n<10M
Creator
CSU-JPG
Year
2025
License
mit
Downloads
3806
Likes
39
Download Homepage

Related Text To Image datasets

FAQ