Implementations
Support for Reading and Writing by Format
| Format |
Download |
Read |
Write |
| Acoustic Event Dataset |
x |
x |
|
| AudioMNIST |
x |
x |
|
| Broadcast |
|
x |
|
| Common Voice |
x |
x |
|
| Default |
|
x |
x |
| ESC-50 |
x |
x |
|
| Free-Spoken-Digit-Dataset |
x |
x |
|
| Folder |
|
x |
|
| Fluent Speech Commands Dataset |
|
x |
|
| Google Speech Commands |
|
x |
|
| GTZAN |
x |
x |
|
| Kaldi |
|
x |
x |
| LibriSpeech |
x |
x |
|
| Mozilla DeepSpeech |
|
|
x |
| MUSAN |
x |
x |
|
| M-AILABS Speech Dataset |
x |
x |
|
| LITIS Rouen Audio scene dataset |
x |
x |
|
| Spoken Wikipedia Corpora |
x |
x |
|
| Tatoeba |
x |
x |
|
| TIMIT |
|
x |
|
| TUDA German Distant Speech |
x |
x |
|
| Urbansound8k |
|
x |
|
| VoxForge |
x |
x |
|
| Wav2Letter |
|
|
x |
Acoustic Event Dataset
AudioMNIST
Broadcast
Common-Voice
Default
ESC-50
Folder
Free-Spoken-Digit-Dataset
Fluent Speech Commands Dataset
Google Speech Commands
GTZAN
Kaldi
LibriSpeech
Mozilla DeepSpeech
MUSAN
M-AILABS Speech Dataset
NVIDIA Jasper
LITIS Rouen Audio scene dataset
SWC - Spoken Wikipedia Corpora
Tatoeba
TIMIT DARPA Acoustic-Phonetic Continuous Speech Corpus
TUDA German Distant Speech
Urbansound8k
VoxForge
Wav2Letter