Skip to content

ProGenNo/py-pgatk-UiB-fork

 
 

Repository files navigation

Python tools for ProteoGenomics Analysis Toolkit

Python application install with bioconda Codacy Badge PyPI version PyPI - Downloads

pypgatk is a Python library part of the ProteoGenomics Analysis Toolkit. It provides different bioinformatics tools for proteogenomics data analysis.

This is a fork made for the purposes of a proteogenomics project at University of Bergen

Requirements:

  • pip: you will require Python3 and pip3 installed.
  • Source code: to use and install from the source code directly, you will need to have git, Python3 and pip.

Installation

Use latest source code

Alternatively, for the latest version, clone this repo and go into its directory, then execute pip3 install . :

git clone https://github.com/ProGenNo/py-pgatk-UiB-fork.git
cd py-pgatk-UiB-fork
# you might want to create a virtualenv for pypgatk before installing
python3 -m pip install .

Usage

The pypgatk design combines multiple modules and tools into one framework. All the possible commands are accessible using the commandline tool pypgatk_cli.py.

$: pypgatk_cli.py -h
Usage: pypgatk_cli.py [OPTIONS] COMMAND [ARGS]...

  This is the main tool that give access to all commands and options
  provided by the pypgatk

Options:
  -h, --help  Show this message and exit.

Commands:
  cbioportal-downloader     Command to download the the cbioportal studies
  cbioportal-to-proteindb  Command to translate cbioportal mutation data into
                           proteindb
  cosmic-downloader        Command to download the cosmic mutation database
  cosmic-to-proteindb      Command to translate Cosmic mutation data into
                           proteindb
  dnaseq-to-proteindb      Generate peptides based on DNA sequences
  ensembl-downloader       Command to download the ensembl information
  generate-decoy           Create decoy protein sequences. Each protein is
                           reversed and the cleavage sites switched with
                           preceding amino acid. Peptides are checked for
                           existence in target sequences if foundthe tool will
                           attempt to shuffle them. [email protected]
                           2015
  threeframe-translation   Command to perform 3frame translation
  vcf-to-proteindb         Generate peptides based on DNA variants from
                           ENSEMBL VEP VCF files

The library provides multiple commands to download, translate and generate protein sequence databases from reference and mutation genome databases.

Full Documentation

https://pgatk.readthedocs.io/en/latest/pypgatk.html

Cite as

Yasset Perez-Riverol, & Husen M. Umer. (2021, February 3). bigbio/pypgatk: Pre-release v0.0.9 (Version v0.0.9). Zenodo. DOI

About

Python tools for proteogenomics analysis toolkit

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 99.6%
  • Dockerfile 0.4%