princeton-nlp/SWE-bench_Lite
General NLPEnglishBenchmark
Created by princeton-nlp at 2024, the princeton-nlp/SWE-bench_Lite is a General NLP benchmark dataset in English in Parquet format.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About princeton-nlp/SWE-bench_Lite
Dataset Summary
SWE-bench Lite is subset of SWE-bench, a dataset that tests systems’ ability to solve GitHub issues automatically. The dataset collects 300 test Issue-Pull Request pairs from 11 popular Python. Evaluation is performed by unit te...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- princeton-nlp
- Year
- 2024