-
-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RU] Decode pronunciations #1376
Labels
Comments
BoboTiG
added a commit
that referenced
this issue
Sep 2, 2022
I reviewed completely how pronunciations were retrieved in order to tackle that task, including changing where pronunciations were looked for: before we were using the whole word wikicode, now we only use `head_sections` wikicode (faster, and more efficient). It has the following nice side-effects (on top the having local-specific formatting like `\...\` for French, `[...]` for German, etc.): - `EN`, `FR`: now supports multiple cross-sections pronunciations (like ones from Middle English combined to ones from Old English sections, for instance on English data) - `FR`: pronunciations outside the French section are no more taken into account (like using values from Gaulois or other locale sections, that was incorrect) - `PT`: a lot more pronunciations are correctly found now - `SV`: multiple pronunciations are now supported - `RU`: it will be way easier to tackle #1376 then
BoboTiG
added a commit
that referenced
this issue
Sep 3, 2022
I reviewed completely how pronunciations were retrieved in order to tackle that task, including changing where pronunciations were looked for: before we were using the whole word wikicode, now we only use `head_sections` wikicode (faster, and more efficient). It has the following nice side-effects (on top the having local-specific formatting like `\...\` for French, `[...]` for German, etc.): - `EN`, `FR`: now supports multiple cross-sections pronunciations (like ones from Middle English combined to ones from Old English sections, for instance on English data) - `FR`: pronunciations outside the French section are no more taken into account (like using values from Gaulois or other locale sections, that was incorrect) - `PT`: a lot more pronunciations are correctly found now - `SV`: multiple pronunciations are now supported - `RU`: it will be way easier to tackle #1376 then
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Russian pronunciations are using the transcriptions-ru LUA script. We should port it to Python to display proper pronunciations.
Note that it will be easier to tackle after #1174.
Upvote & Fund
The text was updated successfully, but these errors were encountered: