AnnotateVariants

Phillip Richmond ([email protected])

This is a pipeline for variant annotation in the diagnosis of rare genetic disorders. It relies on open source data and has instructions for software installs.

Current Version

Overview

Pipeline Summary & Diagram
Set-up
- Prepare Datasets and databases
- Install Necessary Tools
Run Test
Run Sample

Pipeline Summary And Diagram

This pipeline was designed by Phillip Richmond in order to analyze & prioritize variants in rare genetic disease cases. Currently, the pipeline uses the following list of software in order to accomplish this task, much to the thanks of tools produced and maintained by the lab of Aaron Quinlan:

GEMINI
VCFAnno
VCF2DB

Furthermore, this pipeline utilizes open source datasets within it's annotation framework, including:

CADD
gnomAD
OMIM*
ClinVar
UCSC RefGene
Entrez Gene Summary
HPO Term Mapping
MeSHOPs
pLI
RVIS
FATHMM-XF
Eigen
FunSeq2
Platinum Genomes ConfidentRegions
UCSC Segmental Duplications

* OMIM requires a license for use of the API/downloadable databases, which must be applied for through their website.

Currently, the pipeline is hard coded for a specific cluster that uses the Torque-Moab scheduler. However, I will expand upon this to include other schedulers such as SLURM. Also, generalizing for software install locations, or developing a single install-script via bioconda will also be performed later in 2018.

Set-up

THERE IS A LOT OF SETTING UP TO DO!

NOTE - (2018-09-10) Updates to this are coming. We will have a conda install environment, and a unified acquisition script for the databases.

But, once you get set up, then things run nice and smooth.

Install Necessary software, details in InstallTools.sh

bgzip and tabix
vt
vcftools/bcftools
snpEff
vcfanno
vcf2db
gemini
MToolBox
In-house Scripts

Prepare Third-party Datasets/databases, there are a few scripts to help do this within here:

All the gemini databases
Polyphen2 https://github.com/quinlan-lab/pathoscore/blob/master/score-sets/GRCh37/polyphen2/make.sh
CADD http://krishna.gs.washington.edu/download/CADD/v1.3/whole_genome_SNVs.tsv.gz
ReMM http://remm.visze.de/files/ReMM.v0.3.1.tsv.gz
gnomAD http://gnomad.broadinstitute.org/downloads

Run Test

You can run a test analysis by following the instructions within the Test directory.

Run Sample

Known Issues

There is no inheritance model for de-novo + compound het (GEMINI Limitation). E.g. where a de novo is the second variant of a compound het pair
There is no guarantee/unit tests for deletion + snv (old pipeline hemizygous?)
Currently hard-coded, and not system agnostic. As the pipeline stands it needs a new version with configuration files before it can be ported to a new system (evident with my hard-coded paths).

Improvements

GEMINI ROH - Needs to wait on Brent Pedersen to fix
Better management of duo and singleton analyses. For now, they just can be mined from the General Damaging tab
Better utilization of duo-affected pairs

The Future

TIDEX-Tool (jacques & alice) for better analysis

Additional Contributors

Bhavi Modi, Robin van der Lee, and Solenne Correard are contributors to this project development within the Wasserman lab at BCCHR/CMMT in Vancouver, BC, Canada.

Name		Name	Last commit message	Last commit date
Latest commit History 194 Commits
Cloud		Cloud
Figures		Figures
GeminiQueryScripts		GeminiQueryScripts
GetThirdPartyDatabases		GetThirdPartyDatabases
Install		Install
MToolBox_config_files		MToolBox_config_files
Manuscripts		Manuscripts
PipelineScripts		PipelineScripts
PipelineTemplates		PipelineTemplates
RNAseq		RNAseq
SVMergingAndFiltering		SVMergingAndFiltering
TableAnnotators		TableAnnotators
TableTemplates		TableTemplates
TemplateScripts		TemplateScripts
Test		Test
VCFAnno		VCFAnno
.gitignore		.gitignore
AnnotateVariants_CondaEnv.yml		AnnotateVariants_CondaEnv.yml
CreateAltPositionGnomAD.py		CreateAltPositionGnomAD.py
CreateNewAnnotations.sh		CreateNewAnnotations.sh
GeminiTable2CVL.py		GeminiTable2CVL.py
LICENSE		LICENSE
PrepareRefGenomes.sh		PrepareRefGenomes.sh
README.md		README.md
install.sh		install.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AnnotateVariants

Phillip Richmond ([email protected])

Current Version

Overview

Pipeline Summary And Diagram

Set-up

Run Test

Run Sample

Known Issues

Improvements

The Future

Additional Contributors

About

Releases 2

Packages

Contributors 2

Languages

License

Phillip-a-richmond/AnnotateVariants

Folders and files

Latest commit

History

Repository files navigation

AnnotateVariants

Phillip Richmond ([email protected])

Current Version

Overview

Pipeline Summary And Diagram

Set-up

Run Test

Run Sample

Known Issues

Improvements

The Future

Additional Contributors

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 2

Languages

Packages