Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Malformed characters in contents #1

Open
darkgeek opened this issue Sep 18, 2015 · 0 comments
Open

Malformed characters in contents #1

darkgeek opened this issue Sep 18, 2015 · 0 comments
Assignees

Comments

@darkgeek
Copy link
Owner

There are many malformed characters in lookup results, especially <pr></pr> blocks, just like this:
2015-09-19-113324_202x149_scrot
In fact, its entity name is aitalic_, but unfortunately, it's mapped to &#xFFFD, which is not a valid Unicode character.
A possible solution is to digging into the official gcide repo (http://git.savannah.gnu.org/r/gcide.git), find the respective word definition in CIDE.* files, get the symbol name (in the previous example, aitalic_ has the symbol name <ait/), and then find its real character (in this case, 𝘢) in symbols.png.

@darkgeek darkgeek added the bug label Sep 18, 2015
@darkgeek darkgeek self-assigned this Sep 18, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant