preprocessing_RNAseq_single.smk¶
Snakemake workflow for preprocessing single-end bulk RNA-seq data.
Note
Please make sure that you have Singularity and Snakemake installed on your system and cloned the SnakeNgs repository.
Workflow¶
The rulegraph was created by snakevision.
- Quality control using fastp with the default parameters.
- Alignment using STAR with the parameter
--outFilterMultimapNmax 1
. - Convert the SAM file to BAM file and sort using samtools.
- Collect metrics using Picard
CollectRnaSeqMetrics
. - Make bigWig files using deepTools
bamCoverage
. - Make summary statistics using MultiQC.
Usage¶
1 2 3 4 5 |
|
config.yaml
should contain the following information:
1 2 3 4 |
|
path/to/output
should containfastq
directory with the following structure:
1 2 3 4 5 |
|
-
path/to/star_index
is the directory containing the STAR index. -
/path/to/reference_transcriptome.gtf
is the reference transcriptome in GTF format (e.g.Homo_sapiens.GRCh38.106.gtf
for human transcriptome).
Please refer to the tutorial for more information.