Skip to content

Classification Datasets

There are 45 classification datasets in our directory. Each links to its source, paper, and download — browse the full list below or filter by language.

Classification is the task of sorting inputs into predefined categories — one of the most common supervised-learning tasks. We catalog 45 datasets for it.

Updated June 2026

What languages do classification datasets cover?

Explore other dataset tasks

Frequently asked questions