Merging template updates 2.1 #56

d4straub · 2021-08-02T15:20:34Z

This should solve #52 . This PR is about making the pipeline work again, not adding new features.

As a byproduct, #51 is solved as well.

As far as I can see there is no test with appropriate data for running canu, PycoQC, or nanopolish.

PR checklist

d4straub · 2021-08-04T15:14:23Z

Unfortunately nf-core modules unicycler & spades seem both to allow only short read assembly, quite disappointing. Might have to use custom modules than. I postpone that.

d4straub · 2021-08-06T12:49:25Z

short read processing works.

nf-core modules that I was looking into:

unicycler: allows only short read assembly
Nanoplot: accepts only .fastq.gz files ~~( I have .fastq, could be changed)~~ (the sample sheets specifies incompatible "*.fq.gz")
minimap2: ~~unclear, nf-core tools did not allow me to install it~~ installed minimap/align
kraken2: installed
quast: installed
prokka: ~~could not make it work when not using valid files in "path proteins" or "path prodigal"~~ installed

d4straub · 2021-08-20T08:01:13Z

The container of Nanopolish isn't containing parallel and therefore the process fails (parallel: command not found). I test now without parallel (see last commit 545763a).

d4straub · 2021-08-20T14:01:14Z

Hi @apeltzer , I would need suggestions/comments.
~~NanoPolish seems to work, real data is still running, but it passed previously failing steps. So all seems fine right now.~~ Update: NanoPolish works, everything is working.

What I see is still lacking:

no data in test_full.config yet, therefore no AWS tests or AWS results. For the sake of testing, I could add there the test_hybrid, would that be fine?
edit: alternatively I could add Illumina-only data from PRJNA563526 = https://pubmed.ncbi.nlm.nih.gov/32561582/, either both genomes or just one.
unused parameters are still appearing, i.e. igenomes, params.fasta, params.genomes. Can I remove those or might they have any value? Right now they are not interfering, but are not used either.
I attempted to complete the docs (were before actually incomplete), but some things might have slipped through.

Whats next when this PR is fine? I was thinking to address the most urgent matter(s) in #57 , right now I think its the structural changes. (You might remember, you and Stefan had some trouble with a bacterial assembly, this is solved with this version already, miniasm assembly produces 1 contig in correct size. But that might have not been obvious because all software combinations/options have to be specified individually, I'd recommend that by default all assemblers run based on the provided data, so that one can compare performance of all methods.)

However, it might be best to release the pipeline before those structural changes, because I might need some more weeks/months to implement it?!

apeltzer · 2021-08-23T12:01:04Z

no data in test_full.config yet, therefore no AWS tests or AWS results. For the sake of testing, I could add there the test_hybrid, would that be fine?

Yes, I think this wasn't fully there in the past when I started working on the workflow (which I also took over from Andreas Wilm in the very beginning to be honest!)

edit: alternatively I could add Illumina-only data from PRJNA563526 = https://pubmed.ncbi.nlm.nih.gov/32561582/, either both genomes or just one.

Both fine for me , large scale tests can be large-scale ;-)

unused parameters are still appearing, i.e. igenomes, params.fasta, params.genomes. Can I remove those or might they have any value? Right now they are not interfering, but are not used either.

Drop everything that isn't used, this just confuses people.

I attempted to complete the docs (were before actually incomplete), but some things might have slipped through.

I assume this could all benefit from some extra work on that side. Happy to help out here and there on that and check that all is there.

Whats next when this PR is fine? I was thinking to address the most urgent matter(s) in #57 , right now I think its the structural changes. (You might remember, you and Stefan had some trouble with a bacterial assembly, this is solved with this version already, miniasm assembly produces 1 contig in correct size. But that might have not been obvious because all software combinations/options have to be specified individually, I'd recommend that by default all assemblers run based on the provided data, so that one can compare performance of all methods.)

If this is working fine, one could do it. At the moment it was more a choice of 1 option and then running multiple runs with different tools and finally comparing - but could do as you proposed yes, I agree.

However, it might be best to release the pipeline before those structural changes, because I might need some more weeks/months to implement it?!

Yup, a release should help address open points / issues & then moving forward with the restructure is a good idea 👍🏻

…files

d4straub · 2021-08-24T07:24:35Z

Alright, all done & tested. I tested test_full on our cluster, but not on AWS; but I will start the AWS test using github actions once the PR is merged.

apeltzer · 2021-08-25T13:51:05Z

So local runs finished here too - happy to see this on dev and proceed from there 👍🏻

apeltzer

Looks all good but will do a final review on the dev to masteras this is basically an entire rewrite of the origina DSLv1 pipeline

d4straub · 2021-08-25T13:52:15Z

Great, thanks!

nf-core-bot and others added 14 commits November 19, 2020 13:47

Template update for nf-core/tools version 1.12

8e43acd

Template update for nf-core/tools version 1.12.1

2e7540d

Template update for nf-core/tools version 1.13

225a827

Template update for nf-core/tools version 1.13.1

be7bbe1

Template update for nf-core/tools version 1.13.2

ceebac0

Template update for nf-core/tools version 1.13.3

0f9ba31

Template update for nf-core/tools version 1.14

b4b1d3a

Template update for nf-core/tools version 2.0

fc85b5d

Template update for nf-core/tools version 2.0.1

db29c73

Template update for nf-core/tools version 2.1

e19ae65

barebone template update 2.1

798cea3

fix CHANGELOG.md

e578b4f

fix linting

239a48f

update description and README

45cc713

d4straub self-assigned this Aug 2, 2021

d4straub added the WIP label Aug 2, 2021

d4straub linked an issue Aug 2, 2021 that may be closed by this pull request

Conversion to DSL2 & update of tools #52

Closed

d4straub added this to the 2.0.0 milestone Aug 2, 2021

d4straub added 6 commits August 3, 2021 13:22

update schema and version

b6d1a8f

change samplesheet to csv and fix parsing

5afb220

fix linting

2cc2e70

add skewer

335e175

fix linting

b513931

add more modules

c64afa0

d4straub added 4 commits August 5, 2021 17:25

from samplesheet csv to unicycler channels all fine

0941b47

add skewer version

4034bcf

add prokka, kraken2 and quast

b7cee69

fix linting

6c89976

d4straub added 8 commits August 19, 2021 14:17

remove TODO

34fc917

fix dfast

ad96a76

fix picoqc

9c0b316

fix linting and pycoqc

911186a

update output with medaka and nanopolish

097938e

remove samtools files from results folder

5ecd08e

update parameter descriptions

e7b9ec2

remove parallel from nanopolish

545763a

d4straub added 6 commits August 20, 2021 10:17

fix pycoqc

6c6f01a

amend output and update usage

787ae45

remove tabs

1df85e5

fix nanopolish input

8ba97d6

fix nanopolish input

ae2a20f

remove unused samplesheet check and solve some TODOs

1455d9d

d4straub removed the WIP label Aug 23, 2021

d4straub requested a review from apeltzer August 23, 2021 09:21

d4straub added 4 commits August 23, 2021 14:16

remove any connection to igenomes

61df69d

add full size test dataset

a1baa68

removed unused parameter from test_long

97a1ea8

ignore igenomes_base to avoid frequent warning with institutional pro…

002fcd0

…files

apeltzer approved these changes Aug 25, 2021

View reviewed changes

apeltzer merged commit c343fb9 into nf-core:dev Aug 25, 2021

d4straub deleted the merging-template-updates-2.1 branch August 25, 2021 13:52

Daniel-VM mentioned this pull request Sep 11, 2023

Merging template updates 2.9 and workflow update #86

Merged

20 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merging template updates 2.1 #56

Merging template updates 2.1 #56

d4straub commented Aug 2, 2021 •

edited

Loading

d4straub commented Aug 4, 2021

d4straub commented Aug 6, 2021 •

edited

Loading

d4straub commented Aug 20, 2021

d4straub commented Aug 20, 2021 •

edited

Loading

apeltzer commented Aug 23, 2021

d4straub commented Aug 24, 2021

apeltzer commented Aug 25, 2021

apeltzer left a comment

d4straub commented Aug 25, 2021

Merging template updates 2.1 #56

Merging template updates 2.1 #56

Conversation

d4straub commented Aug 2, 2021 • edited Loading

PR checklist

d4straub commented Aug 4, 2021

d4straub commented Aug 6, 2021 • edited Loading

d4straub commented Aug 20, 2021

d4straub commented Aug 20, 2021 • edited Loading

apeltzer commented Aug 23, 2021

d4straub commented Aug 24, 2021

apeltzer commented Aug 25, 2021

apeltzer left a comment

Choose a reason for hiding this comment

d4straub commented Aug 25, 2021

d4straub commented Aug 2, 2021 •

edited

Loading

d4straub commented Aug 6, 2021 •

edited

Loading

d4straub commented Aug 20, 2021 •

edited

Loading