Skip to content

Japanese Datasets

We catalog 11 Japanese datasets for NLP and machine learning, including 1 benchmarks. Browse the list below or narrow down by task.

This page covers Japanese, a high-resource East Asian language with dedicated NLP tooling. Our directory includes 11 datasets in Japanese.

Updated June 2026

What tasks do Japanese datasets cover?

Datasets in other languages

Frequently asked questions