@@ -211,12 +211,12 @@ Note this will also download a test rice dataset. You can test everything is in
211
211
This should print something like this:
212
212
213
213
Checking required binaries and data sources, set in pangeneTools.pm or in command line:
214
- EXE_MINIMAP : OK (path:/home/contrera/ plant-scripts/pangenes/../lib/minimap2/minimap2)
214
+ EXE_MINIMAP : OK (path:plant-scripts/pangenes/../lib/minimap2/minimap2)
215
215
EXE_BEDTOOLS : OK (path:bedtools)
216
- EXE_GFFREAD : OK (path:/home/contrera/ plant-scripts/pangenes/bin/gffread/gffread)
217
- EXE_COLLINEAR : OK (path:/home/contrera/ plant-scripts/pangenes/_collinear_genes.pl)
218
- EXE_CUTSEQUENCES : OK (path:/home/contrera/ plant-scripts/pangenes/_cut_sequences.pl)
219
- EXE_CLUSTANALYSIS : OK (path:/home/contrera/ plant-scripts/pangenes/_cluster_analysis.pl)
216
+ EXE_GFFREAD : OK (path:plant-scripts/pangenes/bin/gffread/gffread)
217
+ EXE_COLLINEAR : OK (path:plant-scripts/pangenes/_collinear_genes.pl)
218
+ EXE_CUTSEQUENCES : OK (path:plant-scripts/pangenes/_cut_sequences.pl)
219
+ EXE_CLUSTANALYSIS : OK (path:plant-scripts/pangenes/_cluster_analysis.pl)
220
220
EXE_GZIP : OK (path:gzip)
221
221
EXE_BZIP2 : OK (path:bzip2)
222
222
EXE_SORT : OK (path:sort)
@@ -283,18 +283,18 @@ $ perl get_pangenes.pl -d ../files/test_rice
283
283
# get_pangenes.pl -d ../files/test_rice -o 0 -r 0 -t all -c 0 -z 0 -I 0 -m local -w 0 -g 0 -O 0.5 -Q 50 -N 5 -s '' -H 0 -W '' -G '' -B '' -S '' -n 4 -R 0
284
284
285
285
# version ...
286
- # results_directory=/home/contrera/github/ plant-scripts/pangenes/test_rice_pangenes
286
+ # results_directory=plant-scripts/pangenes/test_rice_pangenes
287
287
# parameters: MINGFFLEN=100 GFFACCEPTEDFEATS=gene,mRNA,transcript,exon,CDS GFFVALIDGENEFEAT=gene,mRNA,transcript
288
288
289
289
# checking input files...
290
- # re-using /home/contrera/github/ plant-scripts/pangenes/test_rice_pangenes/_Oryza_indica.ASM465v1.chr1.fna
291
- # re-using /home/contrera/github/ plant-scripts/pangenes/test_rice_pangenes/_Oryza_indica.ASM465v1.chr1.gff
290
+ # re-using plant-scripts/pangenes/test_rice_pangenes/_Oryza_indica.ASM465v1.chr1.fna
291
+ # re-using plant-scripts/pangenes/test_rice_pangenes/_Oryza_indica.ASM465v1.chr1.gff
292
292
# ../files/test_rice/Oryza_indica.ASM465v1.chr1.fa.gz 45.84MB genes=5292 non-valid=0
293
- # re-using /home/contrera/github/ plant-scripts/pangenes/test_rice_pangenes/_Oryza_nivara_v1.chr1.fna
294
- # re-using /home/contrera/github/ plant-scripts/pangenes/test_rice_pangenes/_Oryza_nivara_v1.chr1.gff
293
+ # re-using plant-scripts/pangenes/test_rice_pangenes/_Oryza_nivara_v1.chr1.fna
294
+ # re-using plant-scripts/pangenes/test_rice_pangenes/_Oryza_nivara_v1.chr1.gff
295
295
# ../files/test_rice/Oryza_nivara_v1.chr1.fa.gz 41.54MB genes=5143 non-valid=0
296
- # re-using /home/contrera/github/ plant-scripts/pangenes/test_rice_pangenes/_Oryza_sativa.IRGSP-1.0.chr1.fna
297
- # re-using /home/contrera/github/ plant-scripts/pangenes/test_rice_pangenes/_Oryza_sativa.IRGSP-1.0.chr1.gff
296
+ # re-using plant-scripts/pangenes/test_rice_pangenes/_Oryza_sativa.IRGSP-1.0.chr1.fna
297
+ # re-using plant-scripts/pangenes/test_rice_pangenes/_Oryza_sativa.IRGSP-1.0.chr1.gff
298
298
# ../files/test_rice/Oryza_sativa.IRGSP-1.0.chr1.fa.gz 42.56MB genes=5271 non-valid=0
299
299
300
300
# 3 genomes, 15706 genes
@@ -517,14 +517,14 @@ When you run it you'll see a couple differences in the output:
517
517
518
518
```
519
519
# checking input files...
520
- # re-using /home/contrera/github/ plant-scripts/pangenes/test_rice_pangenes/_Oryza_indica.ASM465v1.chr1.fna
521
- # re-using /home/contrera/github/ plant-scripts/pangenes/test_rice_pangenes/_Oryza_indica.ASM465v1.chr1.gff
520
+ # re-using plant-scripts/pangenes/test_rice_pangenes/_Oryza_indica.ASM465v1.chr1.fna
521
+ # re-using plant-scripts/pangenes/test_rice_pangenes/_Oryza_indica.ASM465v1.chr1.gff
522
522
# ../files/test_rice/Oryza_indica.ASM465v1.chr1.fa.gz 45.84MB genes=5292 non-valid=0 chrs/contigs=1
523
- # re-using /home/contrera/github/ plant-scripts/pangenes/test_rice_pangenes/_Oryza_nivara_v1.chr1.fna
524
- # re-using /home/contrera/github/ plant-scripts/pangenes/test_rice_pangenes/_Oryza_nivara_v1.chr1.gff
523
+ # re-using plant-scripts/pangenes/test_rice_pangenes/_Oryza_nivara_v1.chr1.fna
524
+ # re-using plant-scripts/pangenes/test_rice_pangenes/_Oryza_nivara_v1.chr1.gff
525
525
# ../files/test_rice/Oryza_nivara_v1.chr1.fa.gz 41.54MB genes=5143 non-valid=0 chrs/contigs=1
526
- # re-using /home/contrera/github/ plant-scripts/pangenes/test_rice_pangenes/_Oryza_sativa.IRGSP-1.0.chr1.fna
527
- # re-using /home/contrera/github/ plant-scripts/pangenes/test_rice_pangenes/_Oryza_sativa.IRGSP-1.0.chr1.gff
526
+ # re-using plant-scripts/pangenes/test_rice_pangenes/_Oryza_sativa.IRGSP-1.0.chr1.fna
527
+ # re-using plant-scripts/pangenes/test_rice_pangenes/_Oryza_sativa.IRGSP-1.0.chr1.gff
528
528
# ../files/test_rice/Oryza_sativa.IRGSP-1.0.chr1.fa.gz 42.56MB genes=5271 non-valid=0 chrs/contigs=1
529
529
530
530
...
@@ -1181,7 +1181,6 @@ As get_pangenes.pl includes 3 other scripts, logs are split in independent files
1181
1181
| _ cluster_analysis.pl| test_rice_pangenes/Oryza_nivara_v1chr1_alltaxa_algMmap_ .queue|
1182
1182
1183
1183
The main log of get_pangenes.pl might contain error messages such as:
1184
- 1125 /homes/bcontreras/panoryza/plant-scripts/pangenes/bin/gffread/gffread --keep-genes BarkeBaRT2v18.gff |less
1185
1184
1186
1185
* EXIT, folder_pangenes/_ oryza_sativa_arc.oryza_sativa_chaomeo.algMmap.overlap0.5.patch.tsv does not exist, WGA might have failed or hard drive is still writing it (please re-run). This can happen in HPC cluster jobs due to drive latency issues. The fix is to open the relevant specific log (_ collinear_genes.pl in this case) and look for the failing command, which in this example looks like:
1187
1186
0 commit comments