SpanRuler to indicate spans of arbitary length #13373
Gitclop
started this conversation in
Help: Best practices
Replies: 1 comment
-
You probably want to just have a custom component with some rule-based logic to get you the final outputs. The
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I accidently posted the question in the prodigy forum, but i think this belongs more in the spacy forum:
I am training Spancat-Singlelabel to extract error messages from human written reports.
I've annotated arround 1200 examples and so far the results are OK (F1 Score of 0.75).
There are many error messages that are almost allways indicated by the same phrases. For example "... validation error: <error_message>. ..."
<error_message> can be different in length (often multiple sentences) and tokens. My training examples include a lot of those cases (annotated is only the error_message, not the indicating phrase before) but spaCy is still missing a lot of error_messages.
So my idea was to use SpanRuler to indicate those relevant passages or prephrases in front of the error messages but i am not sure how to use it to mark a span of arbitary length after it.
Or is it not the right tool for the job?
Beta Was this translation helpful? Give feedback.
All reactions