-
Notifications
You must be signed in to change notification settings - Fork 116
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closes #637 #735
Closes #637 #735
Conversation
…ontent tweaks (bigscience-workshop#730) Small changes only
* edit * unchange * stats * added init for ptm * added proc meta script * add single * add vis code * added vis changes * remove proc file * add vis code * add paper script * edit scripts * edit scripts * add readme * remove wip code * add ngram back in * black and isort vis code * move * added pdfs * added pdfs that are not local and not broken * added agg pdf
update streamlit instructions
@galtay @hakunanatasha Are the main blockers to merging this updating this to be consistent with the other hf-hub dataloaders? And is CONTRIBUTING.md updated so with instructions of what needs to be done to put it in the correct format? If so, I can update and resubmit. |
@phlobo I refactored the implementation to the new hub style integration and fixed some minor issues. Please have a look at it. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Super interesting dataset, lgtm!
Add the plant-phenotype dataset. Closes #637
New location for dataset (that corrects errors of nested/missing entities): https://github.com/davidkartchner/PPRcorpus
Checkbox
biodatasets/my_dataset/my_dataset.py
(please use only lowercase and underscore for dataset naming)._CITATION
,_DATASETNAME
,_DESCRIPTION
,_HOMEPAGE
,_LICENSE
,_URLs
,_SUPPORTED_TASKS
,_SOURCE_VERSION
, and_BIGBIO_VERSION
variables._info()
,_split_generators()
and_generate_examples()
in dataloader script.BUILDER_CONFIGS
class attribute is a list with at least oneBigBioConfig
for the source schema and one for a bigbio schema.datasets.load_dataset
function.python -m tests.test_bigbio biodatasets/my_dataset/my_dataset.py
.