Skip to content
forked from lemaslab/rump

A Reproducible Untargeted Metabolomics Data Processing Pipeline

License

Notifications You must be signed in to change notification settings

nishachachad/RUMP

 
 

Repository files navigation

Codacy Badge DOI travis license Hits

RUMP

RUMP workflow

Licence

This program is released as open source software under the terms of GNU GPL-v3.0 License.

Usage

Please refer to our wiki for how to install and use RUMP

Help message

RUMP can display usage information on the command line:

$ Nextflow main.nf --help true
N E X T F L O W  ~  version 19.01.0
Launching `main.nf` [romantic_celsius] - revision: 9004e52396
Project : /Users/xinsongdu/mnt/projects/RUMP
Git info: null - null [null]
Cmd line: /Users/xinsongdu/.pyenv/shims/Nextflow main.nf --help true
Manifest's pipeline version: 0.0.0

RUMP: A Reproducible Untargeted Metabolomics Data Processing Pipeline - Version: 0.0.0 (20200226)
This pipeline is distributed in the hope that it will be useful
but WITHOUT ANY WARRANTY. See the GNU GPL v3.0 for more details.

Please report comments and bugs to [email protected]
or at https://github.com/lemaslab/RUMP/issues.
Check https://github.com/lemaslab/RUMP for updates, and refer to
https://github.com/lemaslab/RUMP/wiki

Usage:
   nextflow run_all.nf [options] -with-docker xinsongdu/lemaslab_rump:v0.0.0

Arguments (it is mandatory to change `input_file` and `mzmine_dir` before running:
----------------------------- common parameters ----------------------------------
    --input_dir_pos                         folder location for positive data, default is 'data/POS'
    --input_dir_neg                         folder location for positive data, default is 'data/NEG'
    --POS_design_path                       location for positive design file, default is 'data/pos_design.csv'
    --NEG_design_path                       location for negative design file, default is 'data/neg_design.csv'
    --cutoff                                cutoff p-value for mummichog pathway analysis, default is 0.05
    --unknown_search                        whether do unknown search for unidentified metabolites or not, default is '0', please set it to '1' when needed
    --version                               whether to show version information or not, default is null
    --help                                  whether to show help information or not, default is null
Please refer to nextflow.config for more options.

Container:
    Docker image to use with -with-docker|-with-singularity options is
    'docker://xinsongdu/lemaslab_rump:v0.0.0'

RUMP supports .mzXML format files.

Components

RUMP accepts .mzXML and .mzXL files. Files are processed in parallel using MZmine-2.53; several statists are calculated using Python3 codes; interactive report is generated with MultiQC; pathway analysis are done with mummichog; unknown metabolites search are done with CEU Mass Mediator. Note that the processes related to unknow search with CEU Mass Mediator is turned off by default due to their unstable server, it can be turned on by setting parameter --unknown_search to "1".

Currently included statistical analysis

  • Student t-test: Test if there is a significant statistical difference of certain peak intensities between the two groups of samples.
  • Venn diagram: Report the number of peaks that are significantly enriched in one of the groups, and the number of peaks that have no significant difference between two groups.
  • Principal component analysis: Dimensional reduction using the peak intensities of the two group samples, and visualize the difference.
  • Hierarchical clustering: Cluster all samples and plot a heatmap to show the difference between samples and peaks.
  • Bar plot: plot the metabolites with top-10 and bottom-10 fold-change for the comparison between two groups. (note: the figure will display abnormally if there is an infinite fold change value)

Logging

Logs and error reports will be stored under logs/ folder after running.

Clean repository

Run the following command to clean all the files generated by Nextflow

bash clear.sh

Exit status values

RUMP returns the following exit status values:

  • 3: Positive file groups are not the same as negative file groups, please check design files.
  • 4: Not all input files are in .mzXML format, please check input data folders.
  • 5: One or more input files does not exist.
  • Other Linux reserved exit codes: https://tldp.org/LDP/abs/html/exitcodes.html

Bug reporting and feature requests

Please submit questions, bug reports and feature requests to the issue tracker on GitHub:

RUMP issue tracker

References

About

A Reproducible Untargeted Metabolomics Data Processing Pipeline

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • HTML 97.9%
  • Python 1.2%
  • Other 0.9%