nvidia/Nemotron-Pretraining-Specialized-v1
Text GenerationEnglish
Nvidia/Nemotron-Pretraining-Specialized-v1 is a text generation dataset in English from nvidia in Parquet format.
About nvidia/Nemotron-Pretraining-Specialized-v1
Nemotron-Pre-Training-Dataset-v2.1
Dataset Description
The Nemotron-Pre-Training-Dataset-v2.1 extends the previously released Nemotron pretraining datasets with refreshed, higher-quality, and more diverse data across...
Details
- Task
- Text Generation
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- nvidia
- Year
- 2025