zai-org/LongCite-45k
Text GenerationQuestion AnsweringEN, ZH
Zai-org/LongCite-45k is a text generation-focused dataset in EN, ZH distributed in Parquet format.
About zai-org/LongCite-45k
LongCite-45k
🤗 [LongCite Dataset] • 💻 [Github Repo] • 📃 [LongCite Paper]
LongCite-45k dataset contains 44,600 long-context QA instances paired with sentence-level citations (both English and Chinese, up to 128,000 words). The data can su...
Details
- Task
- Text Generation, Question Answering
- Language
- EN, ZH
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- zai-org
- Year
- 2024