-
Notifications
You must be signed in to change notification settings - Fork 16
Data Analysis Modules: panoply_omicsev
wcorinne edited this page Aug 25, 2025
·
3 revisions
A tool for large scale omics data tables evaluation. Find full OmicsEV documentation here.
General mandatory inputs:
-
STANDALONE
(String): Either"true"
or"false"
. Determines whether OmicsEV should run as part ofpanoply_main
pipeline with inputs frompanoply_harmonize
, or if OmicsEV should run independently with .gct inputs. Preference is to run withSTANDALONE == "false"
if possible. -
yaml_file
(File): Parameters yaml file from PANOPLY setup. Must contain gene.id.col, groups information, an defaults for OmicsEV parameters. -
label
(String): Label for this data set (e.g. "BRCA.proteome"). HTML report will include this label in the file name.
Mandatory inputs if STANDALONE = "false"
:
-
panoply_harmonize_tar_file
(File): Tar output from panoply_harmonize module. -
ome_type
(String): The PANOPLY omics type of the input (e.g."proteome"
).
Mandatory inputs if STANDALONE = "true"
:
-
data_files
(Array[File]): Array of .gct files to be used for OmicsEV. These are typically all the same -ome from the same samples with some variation in how abundance values were calculated. For example, multiple proteome files generated using different batch correction methods. Array[File] format is["/path/to/file-1.gct","/path/to/file-2.gct","/path/to/file-3.gct"]
. Make sure there is no space between each file, just a single comma. A single data file input is acceptable (e.g.["/path/to/file-1.gct"]
). -
sample_anno_file
(File): A .csv file with sample annotations. File must include same samples as are indata_files
. This file must have columns labeled withclass_column_name
andbatch_column_name
(or their PANOPLY defaults, see below).
General optional inputs:
-
class_column_name
(String): Column in sample annotation file that contains class information. This should be a categorical annotation (e.g. sex, cancer type, etc.) to assess differences between phenotypes. The default in the PANOPLY yaml file is"Type"
. Ifclass_column_name
is not found in the sample annotation file, the default behavior is to include all samples in an artificial class called "Default". -
batch_column_name
(String): Column in sample annotation file that contains batch information. This is used to assess batch effects. Must be integer values with no NA's. IfSTANDALONE = true
, this can also be a column in the column-description of each .gct file indata_files
. The default in the PANOPLY yaml file is"Experiment"
, which is found in outputs from SpectrumMill. Ifbatch_column_name
is not found in the sample annotation file, the default behavior is to include all samples in an artificial batch (batch = 1). -
data_log_transformed
(Boolean): Indicates whether the protein-level data has been log transformed. -
rna_log_transformed
(Boolean): Indicates whether the rna data has been log transformed. -
do_function_prediction
(Boolean): Whether or not to perform gene function prediction analysis. If this is not necessary, it is suggested to turn this off. Gene function prediction takes a considerable amount of time. Default in PANOPLY yaml file isfalse
.
Optional inputs if STANDALONE = true
:
-
rna_file
(File): A .gct file of RNAseq data. Used only for RNA-protein correlation at the end of OmicsEV.
-
report
(File): An HTML report of OmicsEV evaluation metrics for each dataset. -
outputs
(File): A tarball with the full OmicsEV outputs, including intermediary tables and figures.
- Home
- PANOPLY Tutorial
- Data Preparation Modules
-
Data Analysis Modules
- panoply_association
- panoply_blacksheep
- panoply_clumps_ptm_diffexp
- panoply_clumps_ptm
- panoply_clumps_ptm_postprocess
- panoply_cmap_analysis
- panoply_cna_correlation
- panoply_cons_clust
- panoply_immune_analysis
- panoply_metaboanalyst
- panoply_mimp
- panoply_nmf
- panoply_nmf_postprocess
- panoply_omicsev
- panoply_quilts
- panoply_rna_protein_correlation
- panoply_sankey
- panoply_ssgsea
-
Report Modules
- panoply_association_report
- panoply_blacksheep_report
- panoply_clumps_ptm_report
- panoply_cna_correlation_report
- panoply_cons_clust_report
- panoply_immune_analysis_report
- panoply_metaboanalyst_report
- panoply_mimp_report
- panoply_nmf_report
- panoply_normalize_ms_data_report
- panoply_rna_protein_correlation_report
- panoply_sampleqc_report
- panoply_sankey_report
- panoply_ssgsea_report
- Support Modules
- Navigating Results
- PANOPLY without Terra
- Customizing PANOPLY
-
Workflows
- panoply_association_workflow
- panoply_blacksheep_workflow
- panoply_clumps_ptm_workflow
- panoply_immune_analysis_workflow
- panoply_metaboanalyst_workflow
- panoply_nmf_workflow
- panoply_nmf_internal_workflow
- panoply_normalize_filter_workflow
- panoply_process_SM_table
- panoply_sankey_workflow
- panoply_ssgsea_workflow
- Pipelines