Link Search Menu Expand Document

Demo data and code

We provide a complete working example of a job configuration file and associated data set for testing and demonstrating your svCapture installation.

Obtain the demo data, scripts, and support files

Working from whatever folder you’d like on a Linux computer, download and unpack the demo archive (file size = 270 MB):

wget https://mdi-demo.wilsonte-umich.io/files/svx-mdi-tools/svCapture-demo2.tar.gz
tar -xzvf svCapture-demo2.tar.gz
rm svCapture-demo2.tar.gz
cd svCapture-demo2

The entire demo will take place in the ‘svCapture-demo2’ directory so you can easily delete it later.

Reads in the FASTQ files were obtained from cell line GM12878 from tagmentation svCapture libraries in which the central 400 kb of the WWOX gene on chr16 was subjected to probe capture. Reads were filtered to include only chr16 and downsampled to 1M read pairs per sample to keep the demo small and fast. One sample (sv_high) was induced to have a higher SV burden than the other sample (sv_low).

Install everything else

Follow the installation instructions to create:

  • a multi-suite MDI installation
  • an alias to the MDI utility called mdi

If you choose a different type of installation or don’t make an alias, please adjust all commands below as needed.

Next, build the required conda runtime environments by following the installation instructions. All required support software is installed in these environments with appropriate versions.

Finally, download the hg38 reference genome into the demo directory by following the installation instructions. If desired, the demo includes a job file, download-hg38.yml, which will properly install the required genome into the demo folder so you may subsequently easily delete it.

mdi download download-hg38.yml --dry-run # check the configuration but don't do anything
mdi download download-hg38.yml

It will take many minutes to complete the download and extraction.

If you install the genome into a different directory, you will need to edit the svCapture job file.

Explore the svCapture job configuration file

cat svCapture-demo2.yml
mdi inspect svCapture-demo2.yml # check syntax, directories, and report all options

Pipeline options are specified in an extended YAML format that supports variables and option declarations common to multiple pipeline actions. See the file comments for details.

The demo job file is configured to work entirely from your working demo directory - change paths when doing real work, or if you installed the hg38 genome into a different location.

Run the demo pipeline

To execute the demo in the command shell, use:

mdi svCapture svCapture-demo2.yml --dry-run # check the configuration but don't do anything
mdi svCapture svCapture-demo2.yml

To submit the demo to your server cluster job scheduler, use:

mdi submit --dry-run svCapture-demo2.yml
mdi submit svCapture-demo2.yml # add options such as --account if needed on your server

It took us about 10 min total wall time to run the demo.

Depending on your HPC server, you might need to specify additional options, which you can edit into the job file or add at the command line. For example, the following would specify a user account that takes precedence over any value found in the job file.

mdi submit svCapture-demo2.yml --account <userAcount>

Examine the pipeline log

If you ran svCapture as a command above, the log stream will have printed to stdout.

If you submitted the demo to your job scheduler, you can view job status and a log report using commands:

mdi status svCapture-demo2.yml
mdi report -j all svCapture-demo2.yml

This repository file shows a complete log of the demo job sequence as it executed on our server.

Access and visualize the results

The output of the demo pipeline will be in folder ./output.

ls -l output

You may examine the vcf files in the sample folders, etc.

Files samples/samples.svCapture.find.mdi.package.zip and assembled_samples/assembled_samples.svCapture.assemble.mdi.package.zip are data packages ready to be uploaded into the svCapture R Shiny app.

To quickly see the basics, you can view the results of the svCapture demo in our publicly accessible demo server. Login using the passphrase mdi-demo and use the Load from Server button to find the svCapture demo packages and bookmarks.

The best way to run the app server yourself is by installing the MDI Desktop.

Please be aware that the app will be functional but not very informative when looking at the much reduced demo data set.