Skip to content

The BrWaC (Brazilian Portuguese Web as Corpus)

Text CorporaText ClassificationPortuguese

Created by Pedro et al. at 2020, the The BrWaC (Brazilian Portuguese Web as Corpus) is a text corpora dataset in Portuguese in Text format.

About The BrWaC (Brazilian Portuguese Web as Corpus)

This dataset is a large corpus constructed in our lab following the Wacky framework, which was made public for research purposes.

Details

Task
Text Corpora, Text Classification
Language
Portuguese
Format
Text
Rows / instances
n/a
Creator
Pedro et al.
Year
2020
Download Paper

Related Text Corpora, Text Classification datasets

FAQ