Researchers from NVIDIA, Factored.ai, Talon Voice, and others open-source a properly licensed dataset of 1,780 hours of speech in 77 different languages, plus transcriptions.
The post Speech Wikimedia Drops a 200GB Audio Dataset to Train ASR and Speech Translation appeared first on Slator .
For more information, please visit
https://slator.com/speech-wikimedia-drop[...]-train-asr-speech-translation/