Skip to content
Metatext
LLMs
Best LLMs
Benchmarks
Datasets
Models
Tools
Jobs
Sign In
Sign Up
Datasets
/
arXiv Bulk Data
arXiv Bulk Data
Text Corpora
English
The arXiv Bulk Data dataset is a English text corpora resource released in 2011.
Details
Task
Text Corpora
Language
English
Format
Tar
Rows / instances
n/a
Creator
n/a
Year
2011
Download
Paper
Related Text Corpora datasets
CC100
ITD - Dataset de Acordãos do STF de 2010 a 2018
FI News Corpus
Lex2Kids
CC100-Afrikaans
CC100-Amharic
FAQ
What is the arXiv Bulk Data dataset?
Is arXiv Bulk Data a benchmark?
Where can I download arXiv Bulk Data?