-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Prepare data for wastewater materials #43
Comments
Command to run pipeline: nextflow run nf-core/viralrecon \
-profile singularity -r dev \
--max_memory '16.GB' --max_cpus 8 \
--input samplesheet.csv \
--genome 'MN908947.3' \
--platform illumina \
--protocol amplicon \
--primer_set artic \
--primer_set_version 4.1 \
--outdir results/viralrecon \
--variant_caller ivar \
--consensus_caller ivar \
--skip_picard_metrics \
--skip_asciigenome \
--skip_assembly |
Hi @tavareshugo, It took me 2h 11m 9s to run 3 samples on Ubuntu. So 5 samples may take minimum "4hrs". I think we may end up only using preprocessed results instead??? |
Bajuna tested running 3 samples with
|
I consistently get this error:-
I think the bed file even when trying to convert to BED6 as instructed here it is still missing one column apparently two strands columns as in artic bed file. I have found this swift_v2_masterfile here. So, I wonder if we need to extract the left and right primer sequences to the swift bed file hopefully to avoid this error???? Cheers! |
Now with SWIFT provided bed file which was converted to bed6 with this one liner
Should we start now preparing learning objectives for the sars-cov2-genomics wastewater surveillance section? |
We have a subset of 60 samples from Karthikeyan et al. that show a transition between delta and omicron variants.
To run it live in a workshop we need to have the pipeline finish in under 1h.
If that is not possible, we may need to instead use a
preprocessed
folder with results from 60 samples and in the workshop they only process 5-10 samples.--freyja_repeats 0
with viralrecon on a small number of samples. To see if it's possible to skip this step. If this throws error use 1 and see if that works. This is to save time running the pipeline.--primer_bed
option. Also download the reference FASTA file and GFF and pass directly with--fasta
and--gff
.sars-wastewater/participants
for this. This includes:data/reads
- FASTQ files for the 5 samples to be processedresources
- reference genome FASTA and GFF annotation (may be useful for some analysis)preprocessed
- with results from 30 samplesscripts
- shell scripts that they will fix in the exercisesutilities
- python scripts we provide, e.g. to prepare samplesheet or tidy freyja output filessample_info.csv
metadata table with "sample,date,country,location,latitude,longitude"The text was updated successfully, but these errors were encountered: