The BrWaC (Brazilian Portuguese Web as Corpus)
Text CorporaText ClassificationPortuguese
Created by Pedro et al. at 2020, the The BrWaC (Brazilian Portuguese Web as Corpus) is a text corpora dataset in Portuguese in Text format.
About The BrWaC (Brazilian Portuguese Web as Corpus)
This dataset is a large corpus constructed in our lab following the Wacky framework, which was made public for research purposes.
Details
- Task
- Text Corpora, Text Classification
- Language
- Portuguese
- Format
- Text
- Rows / instances
- n/a
- Creator
- Pedro et al.
- Year
- 2020