What's Changed
- Update README.md by @eltociear in #41
- Add Demo Badges for SVC, TTA, and TTS by @RMSnow in #42
- Avoid Unbound Case of the download_root by @Adorable-Qin in #48
- Provide DEMO guide and express thanks by @Adorable-Qin in #55
- DiffWave Vocoder Added by @VocodexElysium in #56
- fix cosine_schedule_with_warmup for VALLE training by @HeCheng0625 in #52
- Added HifiTTS data preprocessor by @zyingt in #53
- fix a bug for vocoder inference by @VocodexElysium in #65
- Fix issues with while loop and trailing slash due to using sh instead of bash by @YasienDwieb in #60
- Fix Compatibility Issue with 'accelerate' Package by Reverting to Version 0.24.1 by @HarryHe11 in #73
- Add Resemblyzer for Speaker Similarity Evaluation & Bug fixes by @Merakist in #75
- Custom dataset & resume training recipe for SVC task by @viewfinder-annn in #72
- Fix bug for issue 76 (Import VariableSampler error) by @HeCheng0625 in #82
- Metrify RawNet3/Resemblyzer as Keywords & Update READMEs by @Merakist in #85
- remove redundant codes and update the function for fs2 feature by @ChenX17 in #86
- Adding Contribution Guideline for Amphion by @HarryHe11 in #92
- Check & Update PR Template by @HarryHe11 in #96
- Add issue templates by @yuantuo666 in #98
- Add WavLM speaker similarity for evaluation by @HeCheng0625 in #97
- Add AudioCaps dataset link for TTA by @HeCheng0625 in #100
- Delete utils/whisper.py by @HarryHe11 in #102
- Accelerate the calculation for CER metrics by @wsywsywsywsywsy979 in #104
- Fix bug for VITS resuming training by @lmxue in #108
- Add VALL-E pre-trained model trained on 6k-hour Librilight by @lmxue in #101
- Add preprocessing scripts for the librilight datasets by @HarryHe11 in #107
- Implement VitsSVC resume training / finetune feature by @viewfinder-annn in #95
- MFA Restructure & Environment Bug Fixes by @Merakist in #121
- Update VALL-E prompt examples by @lmxue in #126
- Update DiffComoSVC by @Lokshaw-Chau in #135
- Refine the multilingual front-end processing module by @lmxue in #137
- fix: G2P module fails to initialize #138 by @yuantuo666 in #139
- feat: support Docker installation by @yuantuo666 in #140
- Add support of visualization by @lmxue in #141
- Multi-speaker VITS & Hi-Fi TTS dataset structure by @zyingt in #131
New Contributors
- @eltociear made their first contribution in #41
- @zyingt made their first contribution in #53
- @YasienDwieb made their first contribution in #60
- @HarryHe11 made their first contribution in #73
- @Merakist made their first contribution in #75
- @yuantuo666 made their first contribution in #98
- @wsywsywsywsywsy979 made their first contribution in #104
- @Lokshaw-Chau made their first contribution in #135
Full Changelog: v0.1.0...v0.1.1-alpha