Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Filter out uncommon nouns #5

Open
tragram opened this issue Nov 21, 2020 · 0 comments
Open

Filter out uncommon nouns #5

tragram opened this issue Nov 21, 2020 · 0 comments
Labels
bug Something isn't working

Comments

@tragram
Copy link
Owner

tragram commented Nov 21, 2020

The noun list is generated using a word frequency list. There are some words, that fit both the noun and a different category. This leads to not common nouns being included, just because they are the same as a "different" common word. An example of this would be i.e. "mäkinen" - doubles as "hilly" and a very common surname (capital M, obviously), but which also technically means a small hill, which isn't a terribly common word, according to the Finns I asked. It therefore probably shouldn't be included in the TOP 1000 list of nouns.

@tragram tragram added the bug Something isn't working label Nov 21, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant