nebius/SWE-bench-extra
General NLPEnglishBenchmark
Nebius/SWE-bench-extra is a General NLP-focused benchmark dataset in English distributed in Parquet format.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About nebius/SWE-bench-extra
Note: This dataset has an improved and significantly larger successor: SWE-rebench.
Dataset Summary
SWE-bench Extra is a dataset that can be used to train or evaluate agentic systems specializing in resolving GitHub issues. It is based ...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- nebius
- Year
- 2024