-
Notifications
You must be signed in to change notification settings - Fork 708
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Expand GTF filtering to remove rows with empty transcript ID when required, fix STAR GTF usage #1107
Conversation
This PR is against the
|
|
Python linting (
|
@drpatelh and I had a long discussion about this PR yesterday, and we think there should be a parameter to disable the GTF filtering entirely, mostly to avoid an unnecessary process where we're confident about the GTF. I think we could address your point here by having the process die with a message when the file ends up empty. If the user is sure they want to proceed they can then disable the filter, and will get the relevant error messages from any active consuming processes that have a problem with that. Edit: problem with that is when scaffold complement is good, but transcript ids are not. Will propose something soon. |
I believe I have addressed both your feedback:
|
@nf-core-bot fix linting |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Smart! I like both: The approach and your implementation of it! Great work!
@nf-core-bot fix linting |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Agree. Nice work! Hopefully, a step in the right direction to solving these painful GTF formatting issues we have seen numerous times.
Thank you for everyone that worked on this problem ! |
In this PR:
filter_gtf_for_genes_in_genome.py
(Python only task!) #1082 while I'm at itCloses #1074 #1102 #1082
PR checklist
nf-core lint
).nextflow run . -profile test,docker --outdir <OUTDIR>
).docs/usage.md
is updated.docs/output.md
is updated.CHANGELOG.md
is updated.README.md
is updated (including new tool citations and authors/contributors).