CSU-JPG/TextAtlas5M
Text To ImageENmit
CSU-JPG/TextAtlas5M is a text to image-focused dataset in EN that provides 5,398,826 labeled examples distributed in Parquet format. It is distributed under the mit license and falls in the 1M<n<10M size category, and has been downloaded 3.8K times.
About CSU-JPG/TextAtlas5M
TextAtlas5M
This dataset is a training set for TextAtlas.
Paper: https://huggingface.co/papers/2502.07870
(All the data in this repo is uploaded :>)
Dataset subsets
Subsets in this dataset are CleanTextSynth, PPT2Details, PPT2Structu...
Details
- Task
- Text To Image
- Language
- EN
- Format
- Parquet
- Rows / instances
- 5398826
- Size
- 1M<n<10M
- Creator
- CSU-JPG
- Year
- 2025
- License
- mit
- Downloads
- 3806
- Likes
- 39