Skip to content

German Datasets

We catalog 12 German datasets for NLP and machine learning, including 1 benchmarks. Browse the list below or narrow down by task.

This page covers German, a high-resource European language widely used in NLP benchmarks. Our directory includes 12 datasets in German.

Updated June 2026

What tasks do German datasets cover?

Datasets in other languages

Frequently asked questions