Skip to content

Commit

Permalink
Update 00_intro_to_variant_calling.md
Browse files Browse the repository at this point in the history
  • Loading branch information
Gammerdinger authored Dec 13, 2023
1 parent e9d761a commit 1f72983
Showing 1 changed file with 8 additions and 0 deletions.
8 changes: 8 additions & 0 deletions lessons/00_intro_to_variant_calling.md
Original file line number Diff line number Diff line change
Expand Up @@ -91,13 +91,21 @@ The workflow for variant analysis that many researchers follow looks like:
</p>

**Sequence Samples** - This is the work of your sequencing facility and they should provide you with FASTQ files.

**Read Quality Control** - It is important to assess the quality of your reads before you further analysis them. In this step we are looking to ensure that there aren't major errors that occurred in the sequencing and that the quality of the reads is high.

**Align Reads** - We need to match our raw FASTQ reads to the place in the genome where they most likely originiated from.

**Processing Alignment Files** - While our reads have now been asssigned to a place in the genome, the files holding this information are not in the format that we need in order to call variants. We will be reformatting the alignment files in order to be useable for variant calling.

**Alignment Quality Control** - This is a second opportunity to evaluate the quality of our data. Here we are ensuring that the data aligns well with our genome.

**Call Variants** - Now we are able to call the raw variants from our data.

**Filter Variants** - Our raw variants may have some artifacts in them and need to be processed to remove them.

**Annotate Variants** - We would be interested to know where our variants fall within the context of current gene models. Here we will be using existing gene models to predict the impact of a variant. Here we will be asking questions such as, is this a variant that creates a premature stop codon or is it in a intergenic region?

**Prioritize Variants** - Assess the annotated variants based upon their predicted functional impacts and narrow the search for important genes that may be causing a disease or trait.

[Next Lesson >>]()
Expand Down

0 comments on commit 1f72983

Please sign in to comment.