Skip to content

Watan-2004 Corpus

Text CorporaArabic

Created by Abbas et al. at 2004, the Watan-2004 Corpus is a text corpora dataset in Arabic containing 20 records in HTML format.

About Watan-2004 Corpus

Dataset contains about 20,000 articles talking about 6 topics: culture, religion, economy, local news, international news and sports.

Details

Task
Text Corpora
Language
Arabic
Format
HTML
Rows / instances
20
Creator
Abbas et al.
Year
2004
Download

Related Text Corpora datasets

FAQ