openbmb/UltraData-Math
Text GenerationEN, ZH
The openbmb/UltraData-Math dataset is a EN, ZH text generation resource from openbmb at 2026.
About openbmb/UltraData-Math
UltraData-Math
π€ Dataset | π» Source Code | π¨π³ δΈζ README
UltraData-Math is a large-scale, high-quality mathematical pre-training dataset totaling 290B+ tokens across three progressive tiersβL1 (170.5B tokens web corpus), L2 (33.7B token...
Details
- Task
- Text Generation
- Language
- EN, ZH
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- openbmb
- Year
- 2026