Monday, May 27, 2024

Events | 2023.09.15

Speech Wikimedia Drops a 200GB Audio Dataset to Train ASR and Speech Translation

Researchers from NVIDIA, Factored.ai, Talon Voice, and others open-source a properly licensed dataset of 1,780 hours of speech in 77 different languages, plus transcriptions.

The post Speech Wikimedia Drops a 200GB Audio Dataset to Train ASR and Speech Translation appeared first on Slator .

 

For more information, please visit
https://slator.com/speech-wikimedia-drop[...]-train-asr-speech-translation/

You need to login to post comments.

Feed last updated 2024/05/31 @5:05 AM

0 COMMENTS:

Follow us on Follow Us on Facebook Follow Us on Twitter
©2006 Translations News