Skip to content

ClueWeb Corpora

ClassificationEnglish

ClueWeb Corpora is a classification-focused dataset in English that provides 340,451,982 labeled examples distributed in Text format.

Details

Task
Classification
Language
English
Format
Text
Rows / instances
340,451,982
Creator
Gabrilovich et al.
Year
2013
Download Paper

Related Classification datasets

FAQ