GitHub - nik-shvetsov/pachy-dnameth: DNA methilation pre-processing pipeline implementation for Pachyderm, inspired by Christian Page

This document describes DNA-methilation data preprocessing pipeline used in UiT. It is inspired by DNAm scripts, written by Christian Page. Link for the whole pipeline and docker image repo: https://github.com/nsh23/pachy-dnameth

Whole pipeline is in R and consists of 7 steps:

Load dataset - load RGSet and samplesheet
Clean data - remove ghost and cross-hybrid probes
BMIQ normalization, background correction and cell counts estimation
CNV calculation based on algorithm implementation in CopyNumber450k package
SVA - factor and variable estimation
Quality control of clean data
Gene annotation to CpG sites

The following graph describes the processing flow in a pipeline and step dependencies:

Requirements: R, minfi, CopyNumber450k, IlluminaHumanMethylationEPICanno.ilm10b2.hg19, IlluminaHumanMethylationEPICmanifest, wateRmelon, RPMM, parallel, ExperimentHub, FlowSorted.Blood.EPIC, FlowSorted.Blood.450k, sva, DNAcopy, meffil
Additional notes: The pipeline was exported to pachyderm framework and tested in HUNT cloud. It took ~4 hours for a Torino_2017 NOWAC dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
img		img
scripts		scripts
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
dnameth.py		dnameth.py
install.R		install.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

License

nik-shvetsov/pachy-dnameth

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages