preprocessing_ChIPseq.smk¶
Snakemake workflow for preprocessing paired-end ChIP-seq/ATAC-seq data.
Note
Please make sure that you have Singularity and Snakemake installed on your system and cloned the SnakeNgs repository.
Workflow¶
The rulegraph was created by snakevision.
- Quality control using fastp with the default parameters.
- Alignment using Bowtie2 with the parameter specified in the
config.yaml. - Convert the SAM file to BAM file and sort using samtools.
- Remove duplicates using Picard
MarkDuplicateswith the parameter--REMOVE_DUPLICATES true. - Collect metrics using Picard
CollectInsertSizeMetrics. - Make fingerprint plots using deepTools
plotFingerprintwith the parameter--minMappingQuality 30 --skipZeros --numberOfSamples 50000 --binSize 10000. - Make bigWig files using deepTools
bamCoveragewith the parameter--binSize 1 --normalizeUsing CPM. - Make summary statistics using MultiQC.
Usage¶
1 2 3 4 5 | |
config.yaml should contain the following information:
1 2 3 4 | |
path/to/outputshould containfastqdirectory with the following structure:
1 2 3 4 5 6 7 8 | |
-
path/to/bowtie2_indexis the directory containing the Bowtie2 index. -
bowtie2_argsis the optional argument for Bowtie2 (e.g.,--very-sensitive).
Please refer to the tutorial for more information.