Skip to content

preprocessing_HITSCLIP.smk

Snakemake workflow for preprocessing HITSCLIP data.

Note

Please make sure that you have Singularity and Snakemake installed on your system and cloned the SnakeNgs repository.

Workflow

preprocessing_HITSCLIP.smk rulegraph

The rulegraph was created by snakevision.

  1. Quality control using fastp with the parameter -l 20 -3 --trim_front1 5.
  2. Alignment using STAR with the parameter --outFilterMultimapNmax 100.
  3. Convert the SAM file to BAM file and sort using samtools.
  4. Remove duplicates using Picard MarkDuplicates with the parameter --REMOVE_DUPLICATES true.
  5. Make bigWig files using deepTools bamCoverage with the parameter --binSize 1.
  6. Make summary statistics using MultiQC.

Usage

1
2
3
4
5
snakemake -s /path/to/SnakeNgs/snakefile/preprocessing_HITSCLIP.smk \
--configfile /path/to/config.yaml \
--cores <int> \
--use-singularity \
--rerun-incomplete

config.yaml should contain the following information:

1
2
3
workdir: /path/to/output
samples: ["SRRXXXXXX", "SRRYYYYYY", "SRRZZZZZZ"]
star_index: /path/to/star_index
  • path/to/output should contain fastq directory with the following structure:
1
2
3
4
5
output/
└── fastq
    ├── SRRXXXXXX.fastq.gz
    ├── SRRYYYYYY.fastq.gz
    └── SRRZZZZZZ.fastq.gz
  • path/to/star_index is the directory containing the STAR index.

Docker image used in the workflow