-
Notifications
You must be signed in to change notification settings - Fork 182
Home
Benjamin Buchfink edited this page Jun 21, 2023
·
29 revisions
# downloading the tool
wget http://github.com/bbuchfink/diamond/releases/download/v2.1.8/diamond-linux64.tar.gz
tar xzf diamond-linux64.tar.gz
# creating a diamond-formatted database file
./diamond makedb --in reference.fasta -d reference
# running a search in blastp mode
./diamond blastp -d reference -q queries.fasta -o matches.tsv
# running a search in blastx mode
./diamond blastx -d reference -q reads.fasta -o matches.tsv
# downloading and using a BLAST database
update_blastdb.pl --decompress --blastdb_version 5 swissprot
./diamond prepdb -d swissprot
./diamond blastp -d swissprot -q queries.fasta -o matches.tsv
Some important points to consider:
-
Repeat masking is applied to the query and reference
sequences by default. To disable it, use
--masking 0
. - DIAMOND is optimized for large input files of >1 million proteins. Naturally the tool can be used for smaller files as well, but the algorithm will not reach its full efficiency.
- The program may use quite a lot of memory and also temporary
disk space. Should the program fail due to running out of either
one, you need to set a lower value for the
block size parameter
-b
. - The sensitivity can be adjusted using the options
--fast
,--mid-sensitive
,--sensitive
,--more-sensitive
,--very-sensitive
and--ultra-sensitive
.