Skip to content

Fill Mask Datasets

There are 9 fill mask datasets in our directory. Each links to its source, paper, and download — browse the full list below or filter by language.

Fill Mask is the task of predicting masked-out words in a sentence, the pre-training objective behind models like BERT. We catalog 9 datasets for it.

Updated June 2026

What languages do fill mask datasets cover?

Explore other dataset tasks

Frequently asked questions