Skip to content

euirim/goodwiki

Text GenerationSummarizationENmit

Created by euirim at 2023, the euirim/goodwiki is a text generation dataset in EN in Parquet format. With 306 downloads and 54 likes, it is actively used by the community. It is released under the mit license and is a 10K<n<100K-scale dataset.

About euirim/goodwiki

GoodWiki Dataset GoodWiki is a 179 million token dataset of English Wikipedia articles collected on September 4, 2023, that have been marked as Good or Featured by Wikipedia editors. The dataset provides these articles in GitHub-flavored Markdo...

Details

Task
Text Generation, Summarization
Language
EN
Format
Parquet
Rows / instances
N/A
Size
10K<n<100K
Creator
euirim
Year
2023
License
mit
Downloads
306
Likes
54
Download Homepage

Related Text Generation, Summarization datasets

FAQ