-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Detected PII word's "start" and "end" are returning the wrong positions #96
Comments
Thanks @flaviabeo . This should have been fixed with: #47 |
What version of orchestrator is being used here? edit: was able to reproduce on latest (b3231a5) The results are correct if a sentence chunker is not used, which points to an issue potentially still with the offsets... |
Root cause was that the detectors used are not returning an output per input i.e. each text in In the meantime it was noticed that with #76 we are doing unnecessary codepoint slicing again to determine |
With the merge of #102 , there will be a clear error message and |
Fixed by the PR #102! |
Describe the bug
The start and end fields returned are different from expected. For example, the e-mail is not in the mask position returned by the request response from the detected PII.
Platform
Please provide details about the environment you are using, including the following:
GR Version 2.0 NLP Client, TLS
Sample Code
POST call to
/api/v1/task/classification-with-text-generation
with the payload:Expected behavior
Observed behavior
The text was updated successfully, but these errors were encountered: