This repository has been archived by the owner on Nov 5, 2024. It is now read-only.
Change input of Peptonizer to (peptide, confidence_score) maps #5
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Historically, the peptonizer pipeline required a PSMS-file as input (generated by tools such as Percolator or ms2rescore). However, the only information that the peptonizer really needs are peptide sequences and their associated confidence scores (produced by a search engine). We've decided that the responsibility of parsing PSMS-files is not part of the Peptonizer and that it's input will start from peptide -> confidence score maps.
This PR removes the specific parsing options for pout-files and ms2rescore files from the Peptonizer package and updates the package itself to start from a dictionary that contains the peptide -> confidence score map instead.
The snakemake workflow has remained unchanged, and can still start with PSMS-files as input. The parsing of these files has been moved to the workflow itself, since we don't want to contaminate the Peptonizer package itself with this parsing code.