- 
                Notifications
    
You must be signed in to change notification settings  - Fork 16
 
Workflows: panoply_clumps_ptm_workflow
Workflow adaptation of the Clumps-PTM tool, which is a top-down spatial-proteomics analysis tool that identifies proteins with nearby-clusters of differentially regulated PTM-sites. The workflow first generates differential-expression results for every feature and maps PTM variable-sites to atomic coordinates, after which it performs Clumps-PTM analysis to identify proteins with clusters of differentially-regulated sites. After analysis is completed the panoply_clumps_ptm_postprocess generates summary figures and PyMol protein-structures, and panoply_clumps_ptm_report generates an interactive report.
This workflow is time-intensive and can become prohibitively expensive if too many annotations are chosen; it is recommended to select relatively few annotations when providing a groupsFile. Additionally, once panoply_mapping has been run once for a given dataset, it can be skipped in future runs by directly providing the output file in mapping_file (assuming the features' accession-numbers and variable-sites IDs have not changed).
| module | description | 
|---|---|
panoply_clumps_ptm_diffexp | 
performs differential-expression analysis on provided PTM datasets for chosen annotations | 
panoply_clumps_ptm_mapping | 
maps PTM sites to atomic coordinates | 
panoply_clumps_ptm | 
runs ClumpsPTM algorithm | 
panoply_clumps_ptm_postprocess | 
generates figures for ClumpsPTM analysis results | 
panoply_clumps_ptm_report | 
creates an interactive R Markdown report of the clumps_ptm results | 
- 
pSTY_gct: (.gctfile) phosphoproteome data matrix - 
acK_gct: (.gctfile) acetylome data matrix - 
ubK_gct: (.gctfile) ubiquitylome data matrix - 
groupsFile: (.csvfile) annotation file, subsetted to annotations of interest for this analysis - 
output_prefix: (String, default="results") prefix used to name the output tar file - 
yaml_file: (.yamlfile) master-parameters.yaml 
- 
PDB_ref_bucket: (String, default="gs://fc-385e9b4e-43ff-44b3-8cf7-036a2a96d102/pdbs_2025_tars/") Google-Cloud Bucket containing a tarred copy of the PDB structural archive (i.e.https://files.wwpdb.org/pub/pdb/data/structures/divided/pdb/) - 
UNIPROT_SWISSPROT: (File, default="gs://fc-385e9b4e-43ff-44b3-8cf7-036a2a96d102/reference_files/uniprot_sprot.fasta") Reference FASTA file with all relevant UNIPROT sequences, to which your sequences will be BLASTed. - 
SIFTS_DB: (File, default="gs://fc-385e9b4e-43ff-44b3-8cf7-036a2a96d102/reference_files/pdb_chain_uniprot.tsv") SIFTS database containing mapping between UNIPROT IDs and PDB IDs. 
- 
accession_col: (String, default="description") GCT rdesc column with protein accession IDs; must use the same ID type as the providedFASTA_ref_filefile. - 
variable_sites_col: (String, default="variableSites") GCT rdesc column with PTM variable site(s) (e.g. 'T527t') - 
mapping_file: (.tsvfile) output file with mapping results from panoply_mapping. If provided directly,panoply_mappingwill be skipped; useful when re-analyzing a dataset that has previously been mapped. - 
mapping_params: (.yamlfile) parameters file from panoply_clumps_ptm_mapping, containing parameters used for PTM-mapping; allows mapping parameters to be listed in the report. 
panoply_clumps_ptm_workflow produces the follow outputs:
- 
clumps_ptm_tar: (.tar) An output tar file that contains the clumps_ptm analysis results - 
clumps_ptm_report: (.htmlfile) Interactive R Markown report. 
- Home
 - PANOPLY Tutorial
 - Data Preparation Modules
 - 
Data Analysis Modules
- panoply_association
 - panoply_blacksheep
 - panoply_clumps_ptm_diffexp
 - panoply_clumps_ptm
 - panoply_clumps_ptm_postprocess
 - panoply_cmap_analysis
 - panoply_cna_correlation
 - panoply_cons_clust
 - panoply_immune_analysis
 - panoply_metaboanalyst
 - panoply_mimp
 - panoply_nmf
 - panoply_nmf_postprocess
 - panoply_omicsev
 - panoply_quilts
 - panoply_rna_protein_correlation
 - panoply_sankey
 - panoply_ssgsea
 
 - 
Report Modules
- panoply_association_report
 - panoply_blacksheep_report
 - panoply_clumps_ptm_report
 - panoply_cna_correlation_report
 - panoply_cons_clust_report
 - panoply_immune_analysis_report
 - panoply_metaboanalyst_report
 - panoply_mimp_report
 - panoply_nmf_report
 - panoply_normalize_ms_data_report
 - panoply_rna_protein_correlation_report
 - panoply_sampleqc_report
 - panoply_sankey_report
 - panoply_ssgsea_report
 
 - Support Modules
 - Navigating Results
 - PANOPLY without Terra
 - Customizing PANOPLY
 - 
Workflows
- panoply_association_workflow
 - panoply_blacksheep_workflow
 - panoply_clumps_ptm_workflow
 - panoply_immune_analysis_workflow
 - panoply_metaboanalyst_workflow
 - panoply_nmf_workflow
 - panoply_nmf_internal_workflow
 - panoply_normalize_filter_workflow
 - panoply_process_SM_table
 - panoply_sankey_workflow
 - panoply_ssgsea_workflow
 
 - Pipelines