Tags: facebookresearch/AudioDec
Tags
Update pretrained model (issue #9, #11) 1. According to issue #9, we implement the codec version (activate_audiodec) with more activations like HiFiGAN and release the pre-trained model “symAAD_vctk_48000_hop300”. 2. We fix the MSTFT 2D conv padding issues mentioned in issue #9 and release the updated “symADuniv_vctk_48000_hop300” and “AudioDec_v3_symADuniv_vctk_48000_hop300_clean”. 3. We implement the more flexible CausalConvTranspose1d padding for arbitrary kernel_size and stride according to issue #11. 4. We release a 24kbps model, “symAD_c16_vctk_48000_hop320”, which achieves better speech quality and robustness to unseen data.