-
Notifications
You must be signed in to change notification settings - Fork 70
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Word pages look unformatted #129
Comments
"Intentional" is the wrong word. |
I suppose Wikimedia should have the parser for this markup. Maybe you can import them? |
I have this problem as well in my Python tool: ilius/pyglossary#48 I think using |
There actually is an easy way to extract the formatted data using https://github.com/tatuylonen/wiktextract |
That tool simply downloads the rendered HTML from Wiktionary website one entry at a time. |
You use it to extract the information which you can then convert to the same format this dictionary is using, making it human readable. I'm using it in my app, there's no readme yet but you can compile and see for yourself how its much cleaner and readerable |
The pages for each particular work look unformatted with lots of metadata tags output as raw text.
Is this intentional? I've just installed the app and testing.
An example (EN.quickdic) rendered in QuickDic compared to the same Wiktionary page in Firefox Android:
App version: 5.5.6
The text was updated successfully, but these errors were encountered: