You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We need to check the average read length, and if it's above 250 or so, we'll need to use STARlong.
To do this, we need to make clean in the STAR/source directory, and then make STARlong to make STARlong. This doesn't build unless STAR is all cleaned out.
We also need less stringent parameters and more seeds per read.
The text was updated successfully, but these errors were encountered:
acesnik
changed the title
Aligning long reads (e.g. nanopore)
Aligning long reads with STAR (e.g. nanopore)
May 7, 2018
Base calling nanopore data is fairly straightforward with albacore. I used the docker container for it with docker run --rm -i -t -v G:/data:/mnt genomicpariscenter/albacore and then ran the command read_fast5_basecaller.py -t 11 -i /mnt/data/reads -s /mnt/data/AlbacoreResults/ -r -f FLO-MIN107 -k SQK-LSK108 where the last two are the chemistries we used for the run.
After basecalling, I tried aligning with STARlong and minimap2. I got more spliced reads with the latter, but they were mostly junk.
I tried correcting the indels with a tool called canu, but that didn't improve things much.
Some notes on PacBio:
The best tool for generating gene models based off of PacBio runs is called quiver, but this only works with PacBio BAM files. The ones generated from STARlong with FASTA read input didn't work.
The PacBio tools are available from bioconda, and downloading them was pretty straightforward.
We need to check the average read length, and if it's above 250 or so, we'll need to use
STARlong
.To do this, we need to
make clean
in theSTAR/source
directory, and thenmake STARlong
to makeSTARlong
. This doesn't build unlessSTAR
is all cleaned out.We also need less stringent parameters and more seeds per read.
The text was updated successfully, but these errors were encountered: