PacBio with 10X barcodes, and partial alignments within it (ideally aiming for all_contig.fasta
-like output)
#1694
-
First off, thank you for the package. There are protocols which mix 10X barcodes with PacBio long read sequencing. The paper uses MiXCR for processing, but more recent versions of the package outperform their proposed workflow in terms of tidiness. I identify appropriate adapters at the start/end of the read, doing a reverse complement if needed, pull out the cell barcode and UMI from the start of the read into mixcr analyze generic-pacbio-with-umi \
--species hsa \
--tag-pattern "^(UMI:N{28})\^(R2:*)" \
R1.fastq \
R2.fastq \
mixcr/mixcr
mixcr exportClones \
--dont-split-files \
-isProductive CDR3 \
-uniqueTagCount Molecule \
-tagCounts \
mixcr/mixcr.clns \
mixcr/mixcr.tsv This results in a file which in principle holds all required information for importing the data into Scirpy via the
Please copy the following information along with the stacktrace:
Version: 4.6.0; built=Sat Dec 09 19:48:42 UTC 2023; rev=c9fafa41fe; lib=repseqio.v4.0
OS: Linux
Java: 17.0.6
Cmd args: refineTagsAndSort --report pls/mixcr.refine.report.txt --json-report pls/mixcr.refine.report.json pls/mixcr.vdjca pls/mixcr.refined.vdjca
picocli.CommandLine$ExecutionException: Error while running command refineTagsAndSort java.lang.IllegalArgumentException: Unexpected filter tag request. Requested UMI but next level is CELL.
at com.milaboratory.mixcr.cli.Main.registerExceptionHandlers$lambda-12(SourceFile:395)
at picocli.CommandLine.execute(CommandLine.java:2088)
at com.milaboratory.mixcr.cli.CommandAnalyze$Cmd$PlanBuilder.executeSteps(SourceFile:463)
at com.milaboratory.mixcr.cli.CommandAnalyze$Cmd.run0(SourceFile:421)
at com.milaboratory.mixcr.cli.MiXCRCommand.run(SourceFile:37)
at picocli.CommandLine.executeUserObject(CommandLine.java:1939)
at picocli.CommandLine.access$1300(CommandLine.java:145)
at picocli.CommandLine$RunLast.executeUserObjectOfLastSubcommandWithSameParent(CommandLine.java:2358)
at picocli.CommandLine$RunLast.handle(CommandLine.java:2352)
at picocli.CommandLine$RunLast.handle(CommandLine.java:2314)
at picocli.CommandLine$AbstractParseResultHandler.execute(CommandLine.java:2179)
at picocli.CommandLine$RunLast.execute(CommandLine.java:2316)
at com.milaboratory.mixcr.cli.Main.registerLogger$lambda-27(SourceFile:514)
at picocli.CommandLine.execute(CommandLine.java:2078)
at com.milaboratory.mixcr.cli.Main.main(SourceFile:101)
Caused by: java.lang.IllegalArgumentException: Unexpected filter tag request. Requested UMI but next level is CELL.
at com.milaboratory.o.gf$b.getKeyExtractor(SourceFile:957)
at com.milaboratory.mitool.refinement.gfilter.KeyedFilterContext.getKeyExtractor(SourceFile:97)
at com.milaboratory.mitool.refinement.gfilter.GroupFilter.filter(SourceFile:297)
at com.milaboratory.o.gf.c(SourceFile:660)
at com.milaboratory.mixcr.cli.CommandRefineTagsAndSort$Cmd.run1(SourceFile:353)
at com.milaboratory.mixcr.cli.MiXCRCommandWithOutputs.run0(SourceFile:69)
at com.milaboratory.mixcr.cli.MiXCRCommand.run(SourceFile:37)
at picocli.CommandLine.executeUserObject(CommandLine.java:1939)
at picocli.CommandLine.access$1300(CommandLine.java:145)
at picocli.CommandLine$RunLast.executeUserObjectOfLastSubcommandWithSameParent(CommandLine.java:2358)
at picocli.CommandLine$RunLast.handle(CommandLine.java:2352)
at picocli.CommandLine$RunLast.handle(CommandLine.java:2314)
at picocli.CommandLine$AbstractParseResultHandler.execute(CommandLine.java:2179)
at picocli.CommandLine$RunLast.execute(CommandLine.java:2316)
at com.milaboratory.mixcr.cli.Main.registerLogger$lambda-27(SourceFile:514)
at picocli.CommandLine.execute(CommandLine.java:2078)
... 13 more
Thanks a lot and sorry for the trouble! |
Beta Was this translation helpful? Give feedback.
Replies: 4 comments 16 replies
-
Hi, Can you share the initial library structure? You mentioned identifying the reads by adapters, so I assume you know the sequence that comes before the CELL barcode in the original reads. I can make a dedicated preset that will not require any preprocessing. Additionally, MiXCR uses partial alignments (those that do not cover the CDR3 region) to build complete clones. The minimal requirement for a clone is that it covers at least the CDR3 region, so the final clonotype table will not include anything that lacks CDR3. It is possible to align such reads and export them as alignments in the early stages of the analysis, but they will not pass clonotype assembly. Also, I am not sure I understand what you mean by "call clonotypes jointly across multiple samples." Could you elaborate on this? If you write to us at [email protected], we can arrange a call to discuss your dataset in detail. Let me know if you are interested. |
Beta Was this translation helpful? Give feedback.
-
Thank you for your response. Sorry for asking about so much stuff at once.
|
Beta Was this translation helpful? Give feedback.
-
Hi,
In this file you will have an alignment for every read with a CELL and UMI barcode sequence. The alignments are not aggregated clones yet, so you can't see the UMI count here. The custom preset by default will produce the clonotype table with a CELL barcode and a UMI count for every assembled clone.
|
Beta Was this translation helpful? Give feedback.
-
Sorry to bump this, but we've gotten some fresh data and I'm running the preset you so nicely shared. I left things chugging, came back a day later expecting to find the entire lot of samples completed, but instead found the second sample stuck printing "Initializing: 100%" over and over again in the barcode correction step rather than moving on to anything. I tried skipping the second sample, and then the third one got stuck in a similar fashion, though I didn't wait a full day this time. We've got more reads this time around, could this somehow be related? Ran like so for transparency:
|
Beta Was this translation helpful? Give feedback.
Hi,
~/.mixcr/presets/
folder.Then run:
In this file you …