nvidia/Llama-Nemotron-Post-Training-Dataset
General NLPEnglish
Created by nvidia at 2025, the nvidia/Llama-Nemotron-Post-Training-Dataset is a General NLP dataset in English in Parquet format.
About nvidia/Llama-Nemotron-Post-Training-Dataset
Llama-Nemotron-Post-Training-Dataset-v1.1 Release
Update [4/8/2025]:
v1.1: We are releasing an additional 2.2M Math and 500K Code Reasoning Data in support of our release of Llama-3.1-Nemotron-Ultra-253B-v1. 🎉
Data Overview
This da...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- nvidia
- Year
- 2025