🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
-
Updated
Aug 16, 2024 - Python
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
SoftVC VITS Singing Voice Conversion
Easily train a good VC model with voice data <= 10 mins!
so-vits-svc fork with realtime support, improved interface and more features.
End-to-End Speech Processing Toolkit
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
A simple, high-quality voice conversion tool focused on ease of use and performance
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
This is now the official location of the Merlin project.
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Just a fork of RVC for easy audio file voice conversion locally
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
可本地部署的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.
The code for the bark-voicecloning model. Training and inference.
Unsupervised Speech Decomposition Via Triple Information Bottleneck
singing voice change based on whisper, and lora for singing voice clone
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Voice Conversion Tool Kit
Add a description, image, and links to the voice-conversion topic page so that developers can more easily learn about it.
To associate your repository with the voice-conversion topic, visit your repo's landing page and select "manage topics."