Training results for Russian 22050Hz Natasha #32
Replies: 7 comments 13 replies
-
cool. It seems it needs more steps to fix sounding. looks like vocoder is not trained to that point |
Beta Was this translation helpful? Give feedback.
-
This time I trained it with SDP for 138K epochs. Now it sounds far better. https://drive.google.com/drive/folders/1y8cDIp0MmSP2LS6V8jZ7fjKLKfw33kas?usp=sharing |
Beta Was this translation helpful? Give feedback.
-
Also, is it still using hifigan vocoder or you used work from your branch?
…On 8/30/23, p0p ***@***.***> wrote:
Also with dp-discriminator?
--
Reply to this email directly or view it on GitHub:
#32 (reply in thread)
You are receiving this because you commented.
Message ID:
***@***.***>
--
with best regards Beqa Gozalishvili
Tell: +995593454005
Email: ***@***.***
Web: https://gozaltech.org
Skype: beqabeqa473
Telegram: https://t.me/gozaltech
facebook: https://facebook.com/gozaltech
twitter: https://twitter.com/beqabeqa473
Instagram: https://instagram.com/beqa.gozalishvili
|
Beta Was this translation helpful? Give feedback.
-
@shigabeev Would you mind sharing what were your steps to organize the dataset and prepare it for training? I am trying to train a model on my native language but I am facing issues, as seen here: |
Beta Was this translation helpful? Give feedback.
-
It would be cool to see samples.
…On 9/1/23, p0p ***@***.***> wrote:
Oops, I'll look into that today. Thanks for letting me know.
--
Reply to this email directly or view it on GitHub:
#32 (reply in thread)
You are receiving this because you commented.
Message ID:
***@***.***>
--
with best regards Beqa Gozalishvili
Tell: +995593454005
Email: ***@***.***
Web: https://gozaltech.org
Skype: beqabeqa473
Telegram: https://t.me/gozaltech
facebook: https://facebook.com/gozaltech
twitter: https://twitter.com/beqabeqa473
Instagram: https://instagram.com/beqa.gozalishvili
|
Beta Was this translation helpful? Give feedback.
-
@shigabeev hi! |
Beta Was this translation helpful? Give feedback.
-
Natasha is already normalized and with accents. So I just added trained NN on it as it is, on graphemes.
…________________________________
From: Xmiler ***@***.***>
Sent: Friday, December 29, 2023 12:26:56 PM
To: p0p4k/vits2_pytorch ***@***.***>
Cc: Ilya Shigabeev ***@***.***>; Mention ***@***.***>
Subject: Re: [p0p4k/vits2_pytorch] Training results for Russian 22050Hz Natasha (Discussion #32)
@shigabeev<https://github.com/shigabeev> hi!
Thanks for sharing your result. Could you tell us the way you pnonemized your dataset please 🙏
—
Reply to this email directly, view it on GitHub<#32 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ACFZXD5FAARNTKB5MHZH7MTYL2EGBAVCNFSM6AAAAAA4DR2WYWVHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM3TSNZQHEYTE>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
The model has trained for only 58,000 steps with no sdp and no duration discriminator. The number of text encoder layers was increased from 6 to 10.
The model, a sound sample and symbols.py for russian can be found on Google drive.
https://drive.google.com/drive/folders/1v-jGF8k_gfIUHHFafA1qGOtzDm3YlVbs?usp=sharing
Beta Was this translation helpful? Give feedback.
All reactions