PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
-
Updated
Feb 27, 2022 - Python
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Deep learning-based subtitle generation model that processes audio datasets to generate accurate text transcriptions. Includes audio feature extraction, encoder-decoder architecture, training pipelines, and evaluation metrics for subtitle alignment.
Add a description, image, and links to the transformer-transducer topic page so that developers can more easily learn about it.
To associate your repository with the transformer-transducer topic, visit your repo's landing page and select "manage topics."