nextflow_pharmacogenomics

Pipeline for Mitochondrial short variant discovery based on GATK best practices and nf-core pipeline rarediseases.

Overview

This document describes the usage of a Nextflow workflow designed for running mitochondrial analysis. This workflow is configured to be run with Docker,Singularity containers, ensuring reproducibility across different computing environments.

Prerequisites

Nextflow >=23.04.0'
Singularity

The pipeline uses Igv-reports to generate reports. Container image is not provided!

Installation

Clone this repository:

git clone this repository URL
cd   nextflow_mitochondrial

Running the Workflow

To successfully run the workflow, different input parameters are required:

nextflow \
run main.nf \
-profile singularity \
--input assets/samplesheet.csv \
--dict /path/to/reference/dictionary/hsapiens.GRCh38.hl.dict \
--fasta /path/to/reference/fasta/hsapiens.GRCh38.hl.fasta \
--fasta_fai /path/to/reference/index/hsapiens.GRCh38.hl.fasta.fai \
--bwamem2 /path/to/normal/bwamem2/ \
--bwamem2mt /path/to/mitochondrial/bwamem2mt/ \
--shift_fasta /path/to/shift/reference/scratch_shift.fasta \
--shift_dict /path/to/shift/dictionary/scratch_shift.dict \
--shift_fai /path/to/shift/interval/scratch_shift.fasta.fai \
--shift_backchain /path/to/backchain/scratch_shift.back_chain \
--snpsift_gnomad /path/to/gnomad_db/chrM_gnomad.genomes.r3.1.silentFixed.vcf.gz \
--snpsift_gnomad_tbi /path/to/gnomad_db/chrM_gnomad.genomes.r3.1.silentFixed.vcf.gz.tbi \
--snpsift_mitomap_disease /path/to/mitomap/chrM_MitoMap_disease.vcf.gz \
--snpsift_mitomap_disease_tbi /path/to/mitomap/chrM_MitoMap_disease.vcf.gz.tbi \
--snpsift_mitomap_polymorphism /path/to/mitomap/chrM_MitoMap_polymorphisms.vcf.gz \
--snpsift_mitomap_polymorphism_tbi /path/to/mitomap/chrM_MitoMap_polymorphisms.vcf.gz.tbi \
--snpsift_mitomap_tip /path/to/mitomap/chrM_MitoTIP.vcf.gz \
--snpsift_mitomap_tip_tbi /path/to/mitomap/chrM_MitoTIP.vcf.gz.tbi \
--igvreport_ideogram /path/to/mitomap/IGV-REPORTS/GRCh38_ideogram.bed \
--blacklist /path/to/blacklist/blacklist_sites.hg38.chrM.bed \
--blacklist_idx /path/to/blacklist/blacklist_sites.hg38.chrM.bed.idx
--outdir results

Input Files

The format of the input file (samplesheet.csv) should contain the following header:

sample: name of the experiment (patient1,patient2...).
lane: squencing lane (1,2,3..).
cram: path to aligned cram file.
crai: path to aligned cram index file.
sex: 1=male; 2=female; other=unknown.
phenotype: Patient status, 0 = missing; 1=unaffected; 2=affected.
paternal_id: Sample id of the father.
maternal_id: Sample id of the mother.
case_id: Only when when generating a family VCF.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
bin		bin
conf		conf
modules		modules
subworkflows		subworkflows
workflows		workflows
.gitignore		.gitignore
README.md		README.md
main.nf		main.nf
nextflow.config		nextflow.config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nextflow_pharmacogenomics

Overview

Prerequisites

Installation

Running the Workflow

Input Files

About

Releases

Packages

Languages

LlaneroHiboreo/nextflow_mitochondrial

Folders and files

Latest commit

History

Repository files navigation

nextflow_pharmacogenomics

Overview

Prerequisites

Installation

Running the Workflow

Input Files

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages