Fix Datasets Currently Mapped as Tasks.TEXTUAL_ENTAILMENT #682

jason-fries · 2022-06-04T22:11:11Z

Several datasets are currently tagged as supporting Tasks.TEXTUAL_ENTAILMENT incorrectly. Perhaps I'm misunderstanding the tasks, but these largely seem like text classification/labeling problems not entailment.

medical_data is a sentiment analysis task not textual entailment. Better suited to pairs or text classification
evidence_inference this is labeling a relationship between text snippets i.e., "intervention of interest either significantly increased, significantly decreased or had significant effect on the outcome, relative to the comparator"

Fix is migrating these to the correct schema

The text was updated successfully, but these errors were encountered:

shamikbose · 2022-06-05T00:25:53Z

@jason-fries The issue with medical_data is addressed in the conversation in #613

So every text comes with a drug mention and what the text thinks of that specific drug. Putting this in the Classification format loses some of that information

jason-fries · 2022-06-05T01:15:54Z

@shamikbose thanks for the comment. If a simple classification schema isn’t suited then this should be a text pairs task if it involves reasoning over 2 units of text. The most important issue is that this is not an entailment task.

shamikbose · 2022-06-05T01:18:04Z

I can tackle the medical_data tomorrow

shamikbose · 2022-06-05T02:41:41Z

@jason-fries medical_data is updated to bigbio_pairs in #684

* Update medical_data.py Updated to `bigbio_pairs` schema Passes all tests * Update medical_data.py * refactor: Refactor SAMD dataset implementation to hub-based schema * fix: Change task for SAMD dataset to TEXT_PAIRS_CLASSIFICATION * Fixed license --------- Co-authored-by: Mario Sänger <[email protected]> Co-authored-by: Florian Borchert <[email protected]>

phlobo · 2024-10-26T07:38:12Z

Both datasets mentioned here now have other tasks assigned, therefore I will close the issue.

jason-fries added the bug Something isn't working label Jun 4, 2022

phlobo mentioned this issue Oct 25, 2024

Closes part of #682 #684

Merged

8 tasks

phlobo closed this as completed Oct 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Datasets Currently Mapped as Tasks.TEXTUAL_ENTAILMENT #682

Fix Datasets Currently Mapped as Tasks.TEXTUAL_ENTAILMENT #682

jason-fries commented Jun 4, 2022

shamikbose commented Jun 5, 2022

jason-fries commented Jun 5, 2022

shamikbose commented Jun 5, 2022

shamikbose commented Jun 5, 2022

phlobo commented Oct 26, 2024

Fix Datasets Currently Mapped as Tasks.TEXTUAL_ENTAILMENT #682

Fix Datasets Currently Mapped as Tasks.TEXTUAL_ENTAILMENT #682

Comments

jason-fries commented Jun 4, 2022

shamikbose commented Jun 5, 2022

jason-fries commented Jun 5, 2022

shamikbose commented Jun 5, 2022

shamikbose commented Jun 5, 2022

phlobo commented Oct 26, 2024