Skip to content

Commit bc81eeb

Browse files
authored
Merge pull request #187 from nf-core/variant_calling
address suggestions from Chris
2 parents 3255a97 + f3319f1 commit bc81eeb

File tree

5 files changed

+17
-14
lines changed

5 files changed

+17
-14
lines changed

CITATIONS.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -64,7 +64,7 @@
6464
6565
- [pycoQC](https://doi.org/10.21105/joss.01236)
6666

67-
> Leger A, Leonardi T, (2019). pycoQC, interactive quality control for Oxford Nanopore Sequencing. Journal of Open Source Software, 4(34), 1236.
67+
> Leger A, Leonardi T, (2019). pycoQC, interactive quality control for Oxford Nanopore Sequencing. Journal of Open Source Software, 4(34), 1236, https://doi.org/10.21105/joss.01236
6868
6969
- [qcat](https://github.com/nanoporetech/qcat)
7070

conf/test_nobc_nodx_vc.config

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -24,6 +24,6 @@ params {
2424
skip_demultiplexing = true
2525
call_variants = true
2626

27-
variant_caller = 'deepvariant'
28-
structural_variant_caller = 'cutesv'
27+
variant_caller = 'medaka'
28+
structural_variant_caller = 'sniffles'
2929
}

docs/output.md

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ This document describes the output produced by the nf-core/nanoseq pipeline. Mos
88

99
## Pipeline overview
1010

11-
The nf-core/nanoseq pipeline is built using [Nextflow](https://www.nextflow.io/). There are many different potential outputs for the pipeline depending on what file inputs and parameters you use. Please see [`main README.md`](../README.md) for a condensed overview of the steps in the pipeline, and the bioinformatics tools used at each step.
11+
The nf-core/nanoseq pipeline is built using [Nextflow](https://www.nextflow.io/). There are many different potential outputs for the pipeline depending on what file inputs and parameters you use. Please see [`main README.md`](../README.md) for a condensed overview of the steps in the pipeline and the bioinformatics tools used at each step.
1212

1313
See [Oxford NanoPore website](https://nanoporetech.com/) for more information regarding the sequencing technology, protocol, and for an extensive list of additional resources.
1414

@@ -54,7 +54,7 @@ _Documentation_:
5454
[NanoLyse](https://github.com/wdecoster/nanolyse)
5555

5656
_Description_:
57-
If you would like to run NanoLyse on the raw FASTQ files you can provide `--run_nanolyse` when running the pipeline. By default, the pipeline will filter lambda phage reads, however you can provide your own FASTA file of "contaminants" with `--nanolyse_fasta`. The filtered FASTQ files will contain raw reads without the specified reference sequences (default: lambda phage sequences).
57+
If you would like to run NanoLyse on the raw FASTQ files you can provide `--run_nanolyse` when running the pipeline. By default, the pipeline will filter lambda phage reads. However, you can provide your own FASTA file of "contaminants" with `--nanolyse_fasta`. The filtered FASTQ files will contain raw reads without the specified reference sequences (default: lambda phage sequences).
5858

5959
## Sequencing QC
6060

@@ -71,7 +71,7 @@ _Documentation_:
7171
[PycoQC](https://github.com/a-slide/pycoQC), [NanoPlot](https://github.com/wdecoster/NanoPlot)
7272

7373
_Description_:
74-
_PycoQC_ and _NanoPlot_ can compute QC metrics and generate plots using the sequencing summary information generated by _Guppy_ e.g. distribution of read length, read length over time, number of reads per barcode and other general stats. _NanoPlot_ can also generates QC metrics directly from FASTQ files as described in the next section.
74+
_PycoQC_ and _NanoPlot_ can compute QC metrics and generate plots using the sequencing summary information generated by _Guppy_, e.g., distribution of read length, read length over time, number of reads per barcode and other general stats. _NanoPlot_ can also generate QC metrics directly from FASTQ files as described in the next section.
7575

7676
![PycoQC - Number of reads per barcode](images/pycoqc_readsperbarcode.png)
7777

@@ -90,11 +90,11 @@ _Documentation_:
9090
[NanoPlot](https://github.com/wdecoster/NanoPlot), [FastQC](http://www.bioinformatics.babraham.ac.uk/projects/fastqc/Help/)
9191

9292
_Description_:
93-
_NanoPlot_ can be used to produce general quality metrics from the per barcode FASTQ files generated by _Guppy_ e.g. quality score distribution, read lengths and other general stats.
93+
_NanoPlot_ can be used to produce general quality metrics from the per barcode FASTQ files generated by _Guppy_ e.g. quality score distribution, read lengths, and other general stats.
9494

9595
![Nanoplot - Read quality vs read length](images/nanoplot_readlengthquality.png)
9696

97-
_FastQC_ can give general quality metrics about your reads. It can provide information about the quality score distribution across your reads, the per base sequence content (%A/C/G/T). You can also generate information about adapter contamination and other over-represented sequences.
97+
_FastQC_ can give general quality metrics about your reads. It can provide information about the quality score distribution across your reads, and the per-base sequence content (%A/C/G/T). You can also generate information about adapter contamination and other over-represented sequences.
9898

9999
## Alignment
100100

@@ -131,7 +131,7 @@ _Documentation_:
131131
[BEDTools](https://bedtools.readthedocs.io/en/latest/), [bedGraphToBigWig](https://genome.ucsc.edu/goldenpath/help/bigWig.html#Ex3), [`bedToBigBed`](https://genome.ucsc.edu/goldenPath/help/bigBed.html#Ex2)
132132

133133
_Description_:
134-
The [bigWig](https://genome.ucsc.edu/goldenpath/help/bigWig.html) format is in an indexed binary format useful for displaying dense, continuous data in Genome Browsers such as the [UCSC](https://genome.ucsc.edu/cgi-bin/hgTracks) and [IGV](http://software.broadinstitute.org/software/igv/). This mitigates the need to load the much larger BAM file for data visualisation purposes which will be slower and result in memory issues. The bigWig format is also supported by various bioinformatics software for downstream processing such as meta-profile plotting.
134+
The [bigWig](https://genome.ucsc.edu/goldenpath/help/bigWig.html) format is an indexed binary format useful for displaying dense, continuous data in Genome Browsers such as the [UCSC](https://genome.ucsc.edu/cgi-bin/hgTracks) and [IGV](http://software.broadinstitute.org/software/igv/). This mitigates the need to load the much larger BAM file for data visualisation purposes which will be slower and result in memory issues. The bigWig format is also supported by various bioinformatics software for downstream processing such as meta-profile plotting.
135135

136136
[bigBed](https://genome.ucsc.edu/goldenPath/help/bigBed.html) are more useful for displaying distribution of reads across exon intervals as is typically observed for RNA-seq data. Therefore, these files will only be generated if `--protocol directRNA` or `--protocol cDNA` are defined.
137137

@@ -176,7 +176,7 @@ _Documentation_:
176176
[Sniffles](https://github.com/fritzsedlazeck/Sniffles), [cuteSV](https://github.com/tjiangHIT/cuteSV)
177177

178178
_Description_:
179-
If the `--protocol DNA` and the `--call_variants` parameters are defined small and structural variants.
179+
If the `--protocol DNA` and the `--call_variants` parameters are defined then both small and structural variant variant calls can be generated.
180180
Short variants can be called using _medaka_, _deepvariant_ or _pepper_margin_deepvariant_. The short variant caller is specified using the `--variant_caller` parameter.
181181
Structural variants can be called using either _cuteSV_ or _sniffles_. The structural variant caller is specified using the `--structural_variant_caller` parameter.
182182
The short variant and/or structural variant calling steps is skipped if using the `--skip_vc` and `--skip_sniffles` flags.
@@ -189,9 +189,9 @@ The short variant and/or structural variant calling steps is skipped if using th
189189
If bambu is used:
190190

191191
- `bambu/`
192-
- `extended_annotations.gtf`: a GTF file that contains both annotated and novel transcripts
193-
- `counts_gene.txt`: a TXT file containing gene expression estimates
194-
- `counts_transcript.txt`: a TXT file containing transcript expression estimates
192+
- `extended_annotations.gtf`: a GTF file that contains both annotated and novel transcripts.
193+
- `counts_gene.txt`: a TXT file containing gene expression estimates.
194+
- `counts_transcript.txt`: a TXT file containing transcript expression estimates.
195195

196196
If StringTie2 is used:
197197

modules/local/bam_rename.nf

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,9 @@ process BAM_RENAME {
55
//container "${ workflow.containerEngine == 'singularity' && !task.ext.singularity_pull_docker_container ?
66
// 'https://containers.biocontainers.pro/s3/SingImgsRepo/biocontainers/v1.2.0_cv1/biocontainers_v1.2.0_cv1.img' :
77
// 'quay.io/biocontainers/biocontainers:latest' }"
8+
container "${ workflow.containerEngine == 'singularity' && !task.ext.singularity_pull_docker_container ?
9+
'https://depot.galaxyproject.org/singularity/sed:4.7.0' :
10+
'quay.io/biocontainers/sed:4.7.0' }"
811

912
input:
1013
tuple val(meta), path(bam)

workflows/nanoseq.nf

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -92,7 +92,7 @@ if (params.call_variants) {
9292
if (!params.skip_sv && params.structural_variant_caller != 'sniffles' && params.structural_variant_caller != 'cutesv') {
9393
exit 1, "Invalid structural variant caller option: ${params.structural_variant_caller}. Valid options: 'sniffles', 'cutesv"
9494
}
95-
if (params.enable_conda && params.variant_caller != 'medaka') {
95+
if (!params.skip_vc && params.enable_conda && params.variant_caller != 'medaka') {
9696
exit 1, "Conda environments cannot be used when using the deepvariant or pepper_margin_deepvariant tools. Valid options: 'docker', 'singularity'"
9797
}
9898
}

0 commit comments

Comments
 (0)