-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
分词优化:将UP主名字作为一个完整的词加入词库 #62
Comments
很遗憾的是,分词本身是不受控的,是浏览器功能,没办法给词库。 |
那在输入分词器前,把描述和标题中UP主的名字用引号引起来,会不会让分词器更好地识别UP主的名字 |
简单验证了一下,中英文引号,空格,均无法动摇分词器的结果 -_-|| |
解决方案Tip up 主的名字可以通过 在分词前把 up 主的名字剪切出来,直接按权重加到 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
有很多UP主的名字很容易被拆散成多个词,建议在分词时将当前UP主的名字加入词库避免被拆散
The text was updated successfully, but these errors were encountered: