Skip to content

bastao/VeraCruz_PT-BR

Text GenerationText ClassificationPT

The bastao/VeraCruz_PT-BR dataset is a PT text generation resource from bastao at 2024. With 32.3K downloads and 17 likes, it is actively used by the community and is a 100M<n<1B-scale dataset.

About bastao/VeraCruz_PT-BR

Dataset Summary The VeraCruz Dataset is a comprehensive collection of Portuguese language content, showcasing the linguistic and cultural diversity of of Portuguese-speaking regions. It includes around 190 million samples, organized by regional...

Details

Task
Text Generation, Text Classification
Language
PT
Format
Parquet
Rows / instances
N/A
Size
100M<n<1B
Creator
bastao
Year
2024
Downloads
32253
Likes
17
Download Homepage

Related Text Generation, Text Classification datasets

FAQ