Skip to content

wikimedia/wikisource

Text GenerationFill MaskAR, AS, AZ

Wikimedia/wikisource is a text generation dataset in AR, AS, AZ from wikimedia in Parquet format.

About wikimedia/wikisource

Dataset Card for Wikimedia Wikisource Dataset Summary Wikisource dataset containing cleaned articles of all languages. The dataset is built from the Wikisource dumps (https://dumps.wikimedia.org/) with one subset per language, each c...

Details

Task
Text Generation, Fill Mask
Language
AR, AS, AZ
Format
Parquet
Rows / instances
N/A
Creator
wikimedia
Year
2022
Download

Related Text Generation, Fill Mask datasets

FAQ