Skip to content

facebook/omnilingual-asr-corpus

Automatic Speech RecognitionAudio ClassificationAAE, AAL, AAOcc-by-4.0

Facebook/omnilingual-asr-corpus is a automatic speech recognition dataset in AAE, AAL, AAO from facebook in Parquet format. It is distributed under the cc-by-4.0 license and falls in the 100K<n<1M size category, and has been downloaded 3.8K times.

About facebook/omnilingual-asr-corpus

Meta Omnilingual ASR Corpus The Omnilingual ASR Corpus is a collection of spontaneous speech recordings and their transcriptions for 348 under-served languages. The corpus was collected as part of Meta FAIR’s Omnilingual ASR project (blog, mode...

Details

Task
Automatic Speech Recognition, Audio Classification
Language
AAE, AAL, AAO
Format
Parquet
Rows / instances
N/A
Size
100K<n<1M
Creator
facebook
Year
2025
License
cc-by-4.0
Downloads
3774
Likes
206
Download Homepage

Related Automatic Speech Recognition, Audio Classification datasets

FAQ