pypgatk is a Python library part of the ProteoGenomics Analysis Toolkit. It provides different bioinformatics tools for proteogenomics data analysis.
This is a fork made for the purposes of a proteogenomics project at University of Bergen
- pip: you will require Python3 and pip3 installed.
- Source code: to use and install from the source code directly, you will need to have git, Python3 and pip.
Alternatively, for the latest version, clone this repo and go into its directory, then execute pip3 install .
:
git clone https://github.com/ProGenNo/py-pgatk-UiB-fork.git
cd py-pgatk-UiB-fork
# you might want to create a virtualenv for pypgatk before installing
python3 -m pip install .
The pypgatk design combines multiple modules and tools into one framework. All the possible commands are accessible using the commandline tool pypgatk_cli.py
.
$: pypgatk_cli.py -h
Usage: pypgatk_cli.py [OPTIONS] COMMAND [ARGS]...
This is the main tool that give access to all commands and options
provided by the pypgatk
Options:
-h, --help Show this message and exit.
Commands:
cbioportal-downloader Command to download the the cbioportal studies
cbioportal-to-proteindb Command to translate cbioportal mutation data into
proteindb
cosmic-downloader Command to download the cosmic mutation database
cosmic-to-proteindb Command to translate Cosmic mutation data into
proteindb
dnaseq-to-proteindb Generate peptides based on DNA sequences
ensembl-downloader Command to download the ensembl information
generate-decoy Create decoy protein sequences. Each protein is
reversed and the cleavage sites switched with
preceding amino acid. Peptides are checked for
existence in target sequences if foundthe tool will
attempt to shuffle them. [email protected]
2015
threeframe-translation Command to perform 3frame translation
vcf-to-proteindb Generate peptides based on DNA variants from
ENSEMBL VEP VCF files
The library provides multiple commands to download, translate and generate protein sequence databases from reference and mutation genome databases.
https://pgatk.readthedocs.io/en/latest/pypgatk.html
Yasset Perez-Riverol, & Husen M. Umer. (2021, February 3). bigbio/pypgatk: Pre-release v0.0.9 (Version v0.0.9). Zenodo.