Skip to content

BIOSSES

Semantic Textual SimilarityEnglish

BIOSSES is a semantic textual similarity dataset in English from Sogancıoglu et al. with 100 records in Word Doc format.

About BIOSSES

Dataset comprises 100 sentence pairs, in which each sentence was selected from the TAC (Text Analysis Conference) Biomedical Summarization Track Training Dataset containing articles from the biomedical domain. TAC dataset consists of 20 articles (reference articles) and citing articles that vary from 12 to 20 for each of the reference articles.

Details

Task
Semantic Textual Similarity
Language
English
Format
Word Doc
Rows / instances
100
Creator
Sogancıoglu et al.
Year
2017
Download Paper

Related Semantic Textual Similarity datasets

FAQ