Skip to content

German Entity Recognition incorrect star_char/end_char #12548

Discussion options

You must be logged in to vote

@DuyguA maybe this ticket interests you?

I had a quick look, it seems to me that the issue is greetings doesn't exist in NER datasets usually. As a result model is clueless in such lines. Also, having rest of the paragraph would help, this expressions are not full sentences anyway just a greeting and a name.

I see 2 options here:

  • Make a small corpus of such cases with greeting + person names, update the NER component so make a custom model.
  • Second option is to play some tricks. In the second example, I put a comma after Hallo and a period at the sentence like this: Hallo, Herr Mueller. , it worked. In the first sentence, I cut the greeting totally and the expressions is only Nadia, the…

Replies: 5 comments 2 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by svlandeg
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@42elenz
Comment options

Comment options

You must be logged in to vote
1 reply
@adrianeboyd
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
lang / de German language data and models feat / ner Feature: Named Entity Recognizer perf / accuracy Performance: accuracy
6 participants
Converted from issue

This discussion was converted from issue #12488 on April 19, 2023 13:27.