You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
However, we noticed that when a user searched for the term Dior化粧品, it did not produce a match (using same analyzer settings). The reason is that the search term is tokenized as such:
Since the word cosmetics is the Japanese term 化粧品, it seems like the search term got analyzed correctly but the piece of text produced an unexpected bigram sequence of 化粧 and 品等
Not sure if this is a valid issue due to the mix of English/Japanese in the text or my Japanese fundamentals are off here
The text was updated successfully, but these errors were encountered:
Text =>
Dior化粧品等の輸入総代理店で
, which indexed with the default Kuromoji analyzer produces the following tokens:However, we noticed that when a user searched for the term
Dior化粧品
, it did not produce a match (using same analyzer settings). The reason is that the search term is tokenized as such:Since the word
cosmetics
is the Japanese term 化粧品, it seems like the search term got analyzed correctly but the piece of text produced an unexpected bigram sequence of 化粧 and 品等Not sure if this is a valid issue due to the mix of English/Japanese in the text or my Japanese fundamentals are off here
The text was updated successfully, but these errors were encountered: