Libri-light 㯠LibriVox ããçæãããã³ã¼ãã¹1. ãªã®ã§ LibriSpeech ã®è¦ªæ2.
- Unlabelled Speech Training Set
- unlab-60k
- unlab-6k
- unlab-600
- Dev and Test Set (totally same as LibriSpeech3)
- dev-clean: 5.4 hours
- dev-other: 5.3 hours
- test-clean: 5.4 hours
- test-other: 5.1 hours
Trainã«ã¯dev/testã®è©±è ãå«ã¾ããªãããé¤å¤æ¸ã¿4.
ç¸å½ã«ãã«ãã³ã¼ãã¹ã§ããã¼ã¿ã¯æ¯è¼ç綺éºã ãã©ç¡é³åºéã¨ãã¯æ®éã«æ®ã£ã¦ã.
Riviere-2020-Toward è«æã§ãã¤ãºã¨ASRã®é¢ä¿ãæ¢ããã¦ãã¦ããã®éã«ã³ã¼ãã¹ã®ã¯ãªã¼ãã³ã°ãè¡ããã¦ãã.
unlab-60k ããé¸å¥ããã¦6kã¨600ãä½ãç´ãã¦ãããLL6k-e-loCTC / LL600-e-loCTC ã¨åä»ãããã¦ãã (unlab-60k ã¯å®è³ªçãªçºå£°é·ãã4.7kãããã½ã).
-
“This dataset was obtained by extracting audio files for English speech from the LibriVox repository”↩
-
“LibriSpeech is … derived from read audiobooks from the LibriVox project”↩
-
“The dev and test sets are the same as that of LibriSpeech”↩
-
“We then removed … speakers appearing in LibriSpeech dev and test sets.”↩