Question 1

What is the AudioSet dataset?

Accepted Answer

Dataset consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos.

Question 2

Is AudioSet a benchmark?

Accepted Answer

AudioSet is a dataset for training or evaluation; it isn't tracked as a standard LLM benchmark in our catalog.

Question 3

Where can I download AudioSet?

Accepted Answer

AudioSet is available at its source: https://research.google.com/audioset/download.html.

AudioSet

About AudioSet

Details

Related Speech Recognition, Visual datasets

FAQ