For specific questions/problems please email David Redmond at: [email protected]
This project is an implementation of a pipeline for detecting TCR Variable reads from RNAseq data in python
The pipeline needs for the following programs to be installed and the paths :
http://mirrors.vbi.vt.edu/mirrors/ftp.ncbi.nih.gov/blast/executables/release/2.2.15/
The accompanying paths need to be changed in the script cmd_line_tcrvseq.py:
blastallDir="/path/to/blastall/"
lengthScript="/path/to/calc.median.read.length.pl"
Also the user can select their chosen TCR alpha and beta V and C reference databases (we recommend downloading from imgt.org) and enter their locations:
location for FASTA BLAST reference sequences downloadable from imgt.org - (needs to be changed manually)
humanTRAVblast="/path/to/TRAV.human.fa"
humanTRBVblast="/path/to/TRBV.human.fa"
humanTRACblast="/path/to/TRAC.human.fa"
humanTRBCblast="/path/to/TRBC.human.fa"
mouseTRAVblast="/path/to/TRAV.mouse.fa"
mouseTRBVblast="/path/to/TRBV.mouse.fa"
mouseTRACblast="/path/to/TRAC.mouse.fa"
mouseTRBCblast="/path/to/TRBC.mouse.fa"
We recommend running the pipeline on paired end RNA seq data.
The usage is as follows:
python cmd_line_tcrvseq.py --fastq1 FASTQ1 --fastq2 FASTQ2 --species human/mouse --outdir OUTPUT DIRECTORY --label OUTPUT LABEL
Also running:
Will give command line options