Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Exception with special UTF-8 chars #11

Open
samuelvogel opened this issue Apr 21, 2017 · 0 comments
Open

Exception with special UTF-8 chars #11

samuelvogel opened this issue Apr 21, 2017 · 0 comments
Labels
bug Issues that describe an unexpected behaviour in an existing functionality. minor "bug" issues that are neither "critical" nor "major".

Comments

@samuelvogel
Copy link
Member

The following street containing special parenthesis can't be split:

guangdong zhu hai shi xiang zhou qu wan zai jie dao shi jiao lu eBuy wu liu yuan zhi yong heng ku 66682678(86-13697750078)

See here http://www.utf8-chartable.de/unicode-utf8-table.pl?start=65280&utf8=string-literal:

U+FF08	(	\xef\xbc\x88	FULLWIDTH LEFT PARENTHESIS
U+FF09	)	\xef\xbc\x89	FULLWIDTH RIGHT PARENTHESIS
@samuelvogel samuelvogel added the bug Issues that describe an unexpected behaviour in an existing functionality. label Apr 21, 2017
@samuelvogel samuelvogel changed the title Exception when splitting Exception with special UTF-8 chars Apr 21, 2017
@svenmuennich svenmuennich added the minor "bug" issues that are neither "critical" nor "major". label Mar 1, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Issues that describe an unexpected behaviour in an existing functionality. minor "bug" issues that are neither "critical" nor "major".
Projects
None yet
Development

No branches or pull requests

2 participants