Question 1

What is the neuralwork/arxiver dataset?

Accepted Answer

Arxiver Dataset

Arxiver consists of 63,357 arXiv papers converted to multi-markdown (.mmd) format. Our dataset includes original arXiv article IDs, titles, abstracts, authors, publication dates, URLs and corresponding markdown files published b...

Question 2

Is neuralwork/arxiver a benchmark?

Accepted Answer

neuralwork/arxiver is a dataset for training or evaluation; it isn't tracked as a standard LLM benchmark in our catalog.

Question 3

Where can I download neuralwork/arxiver?

Accepted Answer

neuralwork/arxiver is available at its source: https://huggingface.co/datasets/neuralwork/arxiver.

neuralwork/arxiver

About neuralwork/arxiver

Details

Related General NLP datasets

FAQ