You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There are currently no mature automatic dataset making workflow for English now...
As for Chinese and Japanese, we use Montreal Forced Aligner to get phoneme durations from lyrics. This requires pretrained models from a large singing corpus (~50h), and we haven't done that for English yet.
@yqzhishen Hey how are you. I have about 40-50 hours of private English singing , can you tell me or guide me on how to train an MFA for it ? and what are the requirements etc ? like for example do these 40-50 hours need to be transcribed or have a certain thing ? thanks cant wait to hear from you!!
I have seen many manual dataset creation steps inside the workflow, any way to do that automatically?
And could we connect by mail? I also have several questions.
The text was updated successfully, but these errors were encountered: