Stage 1 pipeline

svAmplicon data analysis first entails execution of a pipeline with primary actions:

align = align input fastq files to the reference genome
extract = scan aligned reads to identify amplicon boundaries and nominate unique reads with structural variants (SVs)

Pipeline inputs

The primary inputs to the svAmplicon pipeline are:

genome files as described under Installation
two FASTQ format read files per sample, obtained from a paired-end, short-read sequencing platform.

The svAmplicon pipeline produces alignment bam files and an app data package for interactive visualization, *.mdi.package.zip.

Options for the different pipeline actions can be listed as follows:

mdi svAmplicon <action> --help

The best way to configure pipeline execution is using YAML job files. The source publication provides comlete examples for the reported samples.