Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

YAHS output is identical to the input assembly #82

Open
sarjopp opened this issue Feb 12, 2024 · 3 comments
Open

YAHS output is identical to the input assembly #82

sarjopp opened this issue Feb 12, 2024 · 3 comments

Comments

@sarjopp
Copy link

sarjopp commented Feb 12, 2024

For three different assemblies, the output from YAHS is exactly the same as the input assembly. I used the Arima-HiC Mapping Pipeline to map my HiC reads to my assemblies. Assemblies are either HiFi-only (assembled with hifiasm) or HiFi+BioNano. The bam mapping statistics look great.

I tried starting with higher resolution (-r 1000) but got the same result. The log file reports
assembly N50 (17086220) too small. Scaffolding anyway

Are my assemblies simply too fragmented for YAHS to succeed? For my organism, this is an exceptionally good N50! The total genome size is 400M and there are 31 chromosomes.

@richarddurbin
Copy link
Collaborator

richarddurbin commented Feb 12, 2024 via email

@sarjopp
Copy link
Author

sarjopp commented Feb 13, 2024

Hi Richard, thanks for your reply! In the post-bionano assembly there are 39 scaffolds, ranging in size from 0,1M to 19.3M. The total length (403.1Mb) matches well with flow cytometry estimates of genome size (~400Mb), so some of the "extra" scaffolds (in cf to the known chromosome number of 31) are presumably fragments of the same chromosome. Which is one of the things I was hoping to resolve with HiC + bionano data.

You asked for sizes, so here you go!
Super-Scaffold_37 19.3M
Super-Scaffold_4 18.4M
Super-Scaffold_9 15.6M
Super-Scaffold_25 15.0M
Super-Scaffold_1 14.6M
Super-Scaffold_12 14.5M
Super-Scaffold_97 14.5M
Super-Scaffold_31 14.3M
Super-Scaffold_33 14.3M
Super-Scaffold_26 13.9M
Super-Scaffold_19 13.7M
Super-Scaffold_83 13.7M
Super-Scaffold_2 13.3M
Super-Scaffold_18 12.9M
Super-Scaffold_28 12.8M
Super-Scaffold_110 12.7M
Super-Scaffold_5 12.7M
Super-Scaffold_85 12.6M
Super-Scaffold_23 12.4M
Super-Scaffold_100016 12.3M
Super-Scaffold_14 11.6M
Super-Scaffold_78 11.5M
Super-Scaffold_32 10.9M
Super-Scaffold_16 10.6M
Super-Scaffold_7 10.4M
Super-Scaffold_100023 9.4M
Super-Scaffold_20 9.4M
Super-Scaffold_30 9.4M
Super-Scaffold_100026 8.1M
Super-Scaffold_13 7.7M
Super-Scaffold_24 7.0M
Super-Scaffold_100031 5.1M
Super-Scaffold_100037 3.8M
Super-Scaffold_34 3.8M
Super-Scaffold_77 0.3M
Super-Scaffold_100141 0.2M
Super-Scaffold_100277 0.2M
Super-Scaffold_100147 0.1M
Super-Scaffold_100148 0.1M

@richarddurbin
Copy link
Collaborator

richarddurbin commented Feb 13, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants