Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ensembl b38 HLA error. #22

Open
sridhar0605 opened this issue May 19, 2023 · 3 comments
Open

Ensembl b38 HLA error. #22

sridhar0605 opened this issue May 19, 2023 · 3 comments
Labels
bug Something isn't working

Comments

@sridhar0605
Copy link

Hi @EricKutschera ,

Using Ensembl gtf and fa with HLA contigs. I see the below error with ESPRESSO_S.pl step

[Thu May 18 21:40:23 2023] Summarizing annotated splice junctions for each read group
HLA-DRB1*03:01:01:02 ne HLA-DRB1*03: HLA-DRB1*03:01:01:02:12301:13089 at /bin/espresso/src/ESPRESSO_S.pl line 462.

Any thoughts? FWIW test data in the repo works fine.

Thank you.
Sid

@sridhar0605
Copy link
Author

For any that would run in to this issue, can confirm removing HLA contigs solved the issue.

grep -v 'HLA-' input.sam > input_filtered.sam

@EricKutschera
Copy link
Contributor

Here's the line for that error: https://github.com/Xinglab/espresso/blob/v1.3.2/src/ESPRESSO_S.pl#L462

ESPRESSO tries to keep some information in a string with : as a separator. Specifically it gives an ID to splice junctions like {chr}:{start}:{end}. Later it tries to parse that ID string, but that fails if the contig has : in the name

In this case HLA-DRB1*03:01:01:02:12301:13089 is the splice junction ID and the parts are HLA-DRB1*03:01:01:02, 12301, and 13089. ESPRESSO ends up thinking the part up to the first : (HLA-DRB1*03) is the contig name

Ideally ESPRESSO should be able to handle any contig name. I'll see if I can change this behavior

@sridhar0605
Copy link
Author

sridhar0605 commented May 22, 2023

Thanks, but my inclination was may be something to do with string/regex expansion. I tried hacking the script following perl regex but failed.

Thanks for looking in to this.

Sid

@EricKutschera EricKutschera added the bug Something isn't working label Aug 9, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants