This program is released as open source software under the terms of GNU GPL-v3.0 License.
Please refer to our wiki for how to install and use RUMP
RUMP can display usage information on the command line:
$ Nextflow main.nf --help true
N E X T F L O W ~ version 19.01.0
Launching `main.nf` [romantic_celsius] - revision: 9004e52396
Project : /Users/xinsongdu/mnt/projects/RUMP
Git info: null - null [null]
Cmd line: /Users/xinsongdu/.pyenv/shims/Nextflow main.nf --help true
Manifest's pipeline version: 0.0.0
RUMP: A Reproducible Untargeted Metabolomics Data Processing Pipeline - Version: 0.0.0 (20200226)
This pipeline is distributed in the hope that it will be useful
but WITHOUT ANY WARRANTY. See the GNU GPL v3.0 for more details.
Please report comments and bugs to [email protected]
or at https://github.com/lemaslab/RUMP/issues.
Check https://github.com/lemaslab/RUMP for updates, and refer to
https://github.com/lemaslab/RUMP/wiki
Usage:
nextflow run_all.nf [options] -with-docker xinsongdu/lemaslab_rump:v0.0.0
Arguments (it is mandatory to change `input_file` and `mzmine_dir` before running:
----------------------------- common parameters ----------------------------------
--input_dir_pos folder location for positive data, default is 'data/POS'
--input_dir_neg folder location for positive data, default is 'data/NEG'
--POS_design_path location for positive design file, default is 'data/pos_design.csv'
--NEG_design_path location for negative design file, default is 'data/neg_design.csv'
--cutoff cutoff p-value for mummichog pathway analysis, default is 0.05
--unknown_search whether do unknown search for unidentified metabolites or not, default is '0', please set it to '1' when needed
--version whether to show version information or not, default is null
--help whether to show help information or not, default is null
Please refer to nextflow.config for more options.
Container:
Docker image to use with -with-docker|-with-singularity options is
'docker://xinsongdu/lemaslab_rump:v0.0.0'
RUMP supports .mzXML format files.
RUMP accepts .mzXML
and .mzXL
files. Files are processed in parallel using MZmine-2.53; several statists are calculated using Python3 codes; interactive report is generated with MultiQC; pathway analysis are done with mummichog; unknown metabolites search are done with CEU Mass Mediator. Note that the processes related to unknow search with CEU Mass Mediator is turned off by default due to their unstable server, it can be turned on by setting parameter --unknown_search
to "1".
- Student t-test: Test if there is a significant statistical difference of certain peak intensities between the two groups of samples.
- Venn diagram: Report the number of peaks that are significantly enriched in one of the groups, and the number of peaks that have no significant difference between two groups.
- Principal component analysis: Dimensional reduction using the peak intensities of the two group samples, and visualize the difference.
- Hierarchical clustering: Cluster all samples and plot a heatmap to show the difference between samples and peaks.
- Bar plot: plot the metabolites with top-10 and bottom-10 fold-change for the comparison between two groups. (note: the figure will display abnormally if there is an infinite fold change value)
Logs and error reports will be stored under logs/
folder after running.
Run the following command to clean all the files generated by Nextflow
bash clear.sh
RUMP returns the following exit status values:
- 3: Positive file groups are not the same as negative file groups, please check design files.
- 4: Not all input files are in .mzXML format, please check input data folders.
- 5: One or more input files does not exist.
- Other Linux reserved exit codes: https://tldp.org/LDP/abs/html/exitcodes.html
Please submit questions, bug reports and feature requests to the issue tracker on GitHub:
- SECIM Core provided valuable guidance related to parameter settings of data processing.
- Sample data is from Metabolomics Workbench PR000188
- Data used for continuous integration is from Metabolights MTBLS146