Skip to content
Jakub Vasicek edited this page Jul 8, 2024 · 2 revisions

Welcome to the ProHap wiki! Here, I will demonstrate how to use ProHap to generate databases of protein sequences, and how to annotate and interpret the peptide identifications after using this database with a search engine.

This repository contains a Snakemake pipeline integrating two highly similar tools:

  • ProHap: The main tool aimed at creating the databases of human protein haplotype sequences (using observed combinations of alleles in each transcript) from phased genotype data sets
  • ProVar: Considers each allele independently, useful for adding rare variation into the resulting sequence database
Clone this wiki locally