Skip to content

amphora/ResearchMath-14k

Text GenerationQuestion AnsweringENBenchmarkmit

The amphora/ResearchMath-14k dataset is a EN text generation resource from amphora at 2026. With 2.6K downloads and 54 likes, it is actively used by the community. It is released under the mit license and is a 10K<n<100K-scale dataset.

📊 This dataset is used as an LLM benchmark. See model leaderboards →

About amphora/ResearchMath-14k

ResearchMath-14k ResearchMath-14k is a collection of 14,056 research-level mathematical problem records extracted from papers, open-problem lists, workshop sheets, and related academic sources. Each record contains the original extracted questi...

Details

Task
Text Generation, Question Answering
Language
EN
Format
Parquet
Rows / instances
N/A
Size
10K<n<100K
Creator
amphora
Year
2026
License
mit
Downloads
2644
Likes
54
Download Homepage

Related Text Generation, Question Answering datasets

FAQ