allenai/reward-bench-results
General NLPEnglish
Allenai/reward-bench-results is a General NLP dataset in English from allenai with 2,093 records in Parquet format. It has been downloaded 63.7K times.
About allenai/reward-bench-results
Results for Holisitic Evaluation of Reward Models (HERM) Benchmark
Here, you'll find the raw scores for the HERM project.
The repository is structured as follows.
├── best-of-n/ <- Nested directory for different compl...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- 2093
- Creator
- allenai
- Year
- 2023
- Downloads
- 63734
- Likes
- 3