You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Has anyone been able to finetune the model on their dataset? I tried on proprietary dataset (very good quality audio at 44.1-48kHz, expressive) but could not improve the final result; indeed some artefacts are introduced [I run stage 1 loading enhancer from pre-trained release model and then stage 2].
I was wondering if anyone managed to fine-tune the model on their own dataset. Because I'm getting the doubt that it can't be done without having the discriminator weights.
The text was updated successfully, but these errors were encountered:
Has anyone been able to finetune the model on their dataset? I tried on proprietary dataset (very good quality audio at 44.1-48kHz, expressive) but could not improve the final result; indeed some artefacts are introduced [I run stage 1 loading enhancer from pre-trained release model and then stage 2].
I was wondering if anyone managed to fine-tune the model on their own dataset. Because I'm getting the doubt that it can't be done without having the discriminator weights.
The text was updated successfully, but these errors were encountered: