Skip to content

Pull requests: NVIDIA/NeMo-Curator

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

ci: Bump release workflow
#373 opened Nov 15, 2024 by ko3n1g Loading…
3 tasks
Synthetic data generation for Retriever Evaluation
#370 opened Nov 14, 2024 by vinay-raman Loading…
3 tasks done
ci: Add copyright-check workflow
#369 opened Nov 14, 2024 by ko3n1g Loading…
3 tasks
Update to latest Crossfit gpuci Run GPU CI/CD on PR
#365 opened Nov 14, 2024 by VibhuJawa Draft
Task-Complexity Classifier
#364 opened Nov 13, 2024 by sarahyurick Draft
Type of Speech Classifier
#361 opened Nov 13, 2024 by sarahyurick Draft
Synthetic Data Generation for Retriever Evaluation
#338 opened Oct 30, 2024 by vinay-raman Loading…
3 tasks done
Add codepath for computing buckets without int conversion
#326 opened Oct 25, 2024 by ayushdg Loading…
3 tasks done
Add support for finetune guard classifier
#325 opened Oct 25, 2024 by VibhuJawa Loading…
Dapt data curation tutorial fuzzy and semantic dedupe gpuci Run GPU CI/CD on PR
#322 opened Oct 24, 2024 by ruchaa-apte Loading…
MinHash improvement using minhash_permuted enhancement New feature or request gpuci Run GPU CI/CD on PR
#313 opened Oct 18, 2024 by praateekmahajan Loading…
3 tasks
Added example notebook for translation with ct2 model. documentation Improvements or additions to documentation
#262 opened Sep 25, 2024 by uahmed93 Draft
3 tasks
Add support for parallel data curation
#193 opened Aug 8, 2024 by shuoyangd Loading…
3 tasks done
Fixed bug: changed to correct model name
#186 opened Aug 6, 2024 by ByteWrite Loading…
1 of 3 tasks
Add Multiple Model Classification example documentation Improvements or additions to documentation
#173 opened Jul 30, 2024 by sarahyurick Loading…
ProTip! Follow long discussions with comments:>50.