Skip to content

Analyses supporting the Medical Genome Reference Bank manuscript

Notifications You must be signed in to change notification settings

mpinese/mgrb-manuscript

Repository files navigation

Code supporting the MGRB manuscript

This repo contains partial code supporting the paper "The Medical Genome Reference Bank contains whole genome and phenotype data of 2570 healthy elderly" by Pinese et al (https://doi.org/10.1038/s41467-019-14079-0).

The code is heavily customised to its execution platform, and uses restricted data files as input. As such, it won't run out of the box, and this repo serves more as a reference for the analytical steps taken for this paper, as well as a space to store additional result and intermediate files which might be of use to the genomics community.

Structure

Directory Contents
processing/ low-level pipeline & processing code
manuscript/ analyses specific to the manuscript. This could do with some cleaning up; currently there is work included which was not included in the final form of the paper.
extra_results/ additional intermediate files which may be of use / interest.
extra_results/genome_tiers/ contains three BED files defining genome confidence tiers on GRCh37. These tiers were useful in the MGRB work to separate loci which were well-genotyped on the Illumina WGS platform used vs those with more dubious data quality. These intermediate files are released here in the hope they will be useful to other researchers working with Illumina WGS data (especially TruSeq Nano).

About

Analyses supporting the Medical Genome Reference Bank manuscript

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published