Pass the databases as parameters, skipping downloading #50

ypriverol · 2022-05-07T16:19:13Z

@DongdongdongW has reported that COSMIC download sometimes fails to download. Email:

During this process, I encountered some problems. For some reason, the database of COSMIC cannot be downloaded. At the same time, the vcf file from ENSEML is missing in the pipeline. So I chose to download the files from these databases myself and generate the proteogenomics database via pypgatk. When selecting the COSMIC database and cBioportal, I only selected data for cell line A549 and lung cancer type. The size of the database containing the decoy generated by the most popular pypgatk is 3.21GB.

We can add the logic of download using wget and also have an option when the user provides the COSMIC file as a parameter in the pipeline and the pipeline do not need to download it.

The text was updated successfully, but these errors were encountered:

DongdongdongW · 2022-05-13T12:44:52Z

I have set the parameters to upload the COSMIC files.

husensofteng · 2022-06-02T15:58:42Z

I think it would actually be good to add a parameter e.g. downloaded_data_dir or similar where the user can put pre-downloaded files that are used by the pipeline.

At each download section in the pipeline we can skip downloading the files that already exist in the given directory. Though, I don't know if there is a nice way to implement this in DSL2.

ypriverol added the enhancement Improvement for existing functionality label May 7, 2022

ypriverol assigned DongdongdongW May 7, 2022

ypriverol added the feature-request Request for a new pipeline feature label May 7, 2022

ypriverol added this to the DLS2 milestone May 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pass the databases as parameters, skipping downloading #50

Pass the databases as parameters, skipping downloading #50

ypriverol commented May 7, 2022

DongdongdongW commented May 13, 2022

husensofteng commented Jun 2, 2022

Pass the databases as parameters, skipping downloading #50

Pass the databases as parameters, skipping downloading #50

Comments

ypriverol commented May 7, 2022

DongdongdongW commented May 13, 2022

husensofteng commented Jun 2, 2022