-
Notifications
You must be signed in to change notification settings - Fork 16
Workflows: panoply_clumps_ptm_workflow
Workflow adaptation of the Clumps-PTM tool, which is a top-down spatial-proteomics analysis tool that identifies proteins with nearby-clusters of differentially regulated PTM-sites. The workflow first generates differential-expression results for every feature and maps PTM variable-sites to atomic coordinates, after which it performs Clumps-PTM analysis to identify proteins with clusters of differentially-regulated sites. After analysis is completed the panoply_clumps_ptm_postprocess generates summary figures and PyMol protein-structures, and panoply_clumps_ptm_report generates an interactive report.
This workflow is time-intensive and can become prohibitively expensive if too many annotations are chosen; it is recommended to select relatively few annotations when providing a groupsFile. Additionally, once panoply_mapping has been run once for a given dataset, it can be skipped in future runs by directly providing the output file in mapping_file (assuming the features' accession-numbers and variable-sites IDs have not changed).
| module | description |
|---|---|
panoply_clumps_ptm_diffexp |
performs differential-expression analysis on provided PTM datasets for chosen annotations |
panoply_clumps_ptm_mapping |
maps PTM sites to atomic coordinates |
panoply_clumps_ptm |
runs ClumpsPTM algorithm |
panoply_clumps_ptm_postprocess |
generates figures for ClumpsPTM analysis results |
panoply_clumps_ptm_report |
creates an interactive R Markdown report of the clumps_ptm results |
-
pSTY_gct: (.gctfile) phosphoproteome data matrix -
acK_gct: (.gctfile) acetylome data matrix -
ubK_gct: (.gctfile) ubiquitylome data matrix -
groupsFile: (.csvfile) annotation file, subsetted to annotations of interest for this analysis -
output_prefix: (String, default="results") prefix used to name the output tar file -
yaml_file: (.yamlfile) master-parameters.yaml
-
PDB_ref_bucket: (String, default="gs://fc-385e9b4e-43ff-44b3-8cf7-036a2a96d102/pdbs_2025_tars/") Google-Cloud Bucket containing a tarred copy of the PDB structural archive (i.e.https://files.wwpdb.org/pub/pdb/data/structures/divided/pdb/) -
UNIPROT_SWISSPROT: (File, default="gs://fc-385e9b4e-43ff-44b3-8cf7-036a2a96d102/reference_files/uniprot_sprot.fasta") Reference FASTA file with all relevant UNIPROT sequences, to which your sequences will be BLASTed. -
SIFTS_DB: (File, default="gs://fc-385e9b4e-43ff-44b3-8cf7-036a2a96d102/reference_files/pdb_chain_uniprot.tsv") SIFTS database containing mapping between UNIPROT IDs and PDB IDs.
-
accession_col: (String, default="description") GCT rdesc column with protein accession IDs; must use the same ID type as the providedFASTA_ref_filefile. -
variable_sites_col: (String, default="variableSites") GCT rdesc column with PTM variable site(s) (e.g. 'T527t') -
mapping_file: (.tsvfile) output file with mapping results from panoply_mapping. If provided directly,panoply_mappingwill be skipped; useful when re-analyzing a dataset that has previously been mapped. -
mapping_params: (.yamlfile) parameters file from panoply_clumps_ptm_mapping, containing parameters used for PTM-mapping; allows mapping parameters to be listed in the report.
panoply_clumps_ptm_workflow produces the follow outputs:
-
clumps_ptm_tar: (.tar) An output tar file that contains the clumps_ptm analysis results -
clumps_ptm_report: (.htmlfile) Interactive R Markown report.
- Home
- PANOPLY Tutorial
- Data Preparation Modules
-
Data Analysis Modules
- panoply_association
- panoply_blacksheep
- panoply_clumps_ptm_diffexp
- panoply_clumps_ptm
- panoply_clumps_ptm_postprocess
- panoply_cmap_analysis
- panoply_cna_correlation
- panoply_cons_clust
- panoply_immune_analysis
- panoply_metaboanalyst
- panoply_mimp
- panoply_nmf
- panoply_nmf_postprocess
- panoply_omicsev
- panoply_quilts
- panoply_rna_protein_correlation
- panoply_sankey
- panoply_ssgsea
-
Report Modules
- panoply_association_report
- panoply_blacksheep_report
- panoply_clumps_ptm_report
- panoply_cna_correlation_report
- panoply_cons_clust_report
- panoply_immune_analysis_report
- panoply_metaboanalyst_report
- panoply_mimp_report
- panoply_nmf_report
- panoply_normalize_ms_data_report
- panoply_rna_protein_correlation_report
- panoply_sampleqc_report
- panoply_sankey_report
- panoply_ssgsea_report
- Support Modules
- Navigating Results
- PANOPLY without Terra
- Customizing PANOPLY
-
Workflows
- panoply_association_workflow
- panoply_blacksheep_workflow
- panoply_clumps_ptm_workflow
- panoply_immune_analysis_workflow
- panoply_metaboanalyst_workflow
- panoply_nmf_workflow
- panoply_nmf_internal_workflow
- panoply_normalize_filter_workflow
- panoply_process_SM_table
- panoply_sankey_workflow
- panoply_ssgsea_workflow
- Pipelines