-
Notifications
You must be signed in to change notification settings - Fork 116
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix Datasets Currently Mapped as Tasks.TEXTUAL_ENTAILMENT #682
Comments
@jason-fries The issue with medical_data is addressed in the conversation in #613
|
@shamikbose thanks for the comment. If a simple classification schema isn’t suited then this should be a text pairs task if it involves reasoning over 2 units of text. The most important issue is that this is not an entailment task. |
I can tackle the medical_data tomorrow |
@jason-fries medical_data is updated to bigbio_pairs in #684 |
* Update medical_data.py Updated to `bigbio_pairs` schema Passes all tests * Update medical_data.py * refactor: Refactor SAMD dataset implementation to hub-based schema * fix: Change task for SAMD dataset to TEXT_PAIRS_CLASSIFICATION * Fixed license --------- Co-authored-by: Mario Sänger <[email protected]> Co-authored-by: Florian Borchert <[email protected]>
Both datasets mentioned here now have other tasks assigned, therefore I will close the issue. |
Several datasets are currently tagged as supporting
Tasks.TEXTUAL_ENTAILMENT
incorrectly. Perhaps I'm misunderstanding the tasks, but these largely seem like text classification/labeling problems not entailment.Fix is migrating these to the correct schema
The text was updated successfully, but these errors were encountered: