Skip to content

PerKey

Keyphrase ExtractionInformation ExtractionPersian

PerKey is a keyphrase extraction-focused dataset in Persian that provides 553,111 labeled examples distributed in JSON format.

About PerKey

Dataset contains 553K news articles from six Persian news websites and agencies with author extracted keyphrases, which is then filtered and cleaned to achieve higher quality keyphrases.

Details

Task
Keyphrase Extraction, Information Extraction
Language
Persian
Format
JSON
Rows / instances
553,111
Creator
Doostmohammadi et al.
Year
2020
Download Paper

FAQ