@@ -63,7 +63,7 @@ EMIRGE expects the following programs to be installed and available in your path
63
63
-usearch (www.drive5.com/usearch/ -- tested with usearch version 6.0.203; versions earlier than this are incompatible).
64
64
-samtools (http://samtools.sourceforge.net/ -- tested with verison 0.1.18)
65
65
-bowtie (http://bowtie-bio.sourceforge.net/index.shtml -- tested with version 0.12.7 and 0.12.8)
66
-
66
+ -vsearch (https://github.com/torognes/vsearch -- optional, but required if you use emirge_makedb.py)
67
67
INSTALLATION
68
68
------------------------------
69
69
@@ -132,20 +132,22 @@ CANDIDATE SSU DATABASE
132
132
133
133
You can download a standard candidate SSU database by running the
134
134
following command:
135
- python emirge_download_candidate_db.py
136
-
137
- This script is included with EMIRGE. The current version of this
138
- database was made using Silva release SSURef_111_NR
139
- (http://www.arb-silva.de/). Sequences were clustered using uclust at
140
- 97% sequence identity, short and long sequences were removed, and
141
- non-standard characters were changed to be within {ACTG} (using
142
- utils/fix_nonstandard_chars.py).
143
-
144
- You can use any reference SSU database with emirge, though this one is
145
- recommended. No matter your choice, you should run
146
- utils/fix_nonstandard_chars.py on your fasta file. You will also need
147
- to first build a bowtie index, with something like:
135
+ python emirge_makedb.py
136
+
137
+ This script is included with EMIRGE, and by default downloads the
138
+ current version of the Silva small subunit rRNA database
139
+ (http://www.arb-silva.de/). Sequences are clustered by default using
140
+ vsearch at 97% sequence identity, short and long sequences are
141
+ removed, and non-standard characters are changed to be within {ACTG}.
142
+ See emirge_makedb.py --help for details.
143
+
144
+ You can use any reference SSU database with emirge, though SILVA is
145
+ recommended. If you use another database, you should run
146
+ utils/fix_nonstandard_chars.py on your fasta file to remove ambiguous
147
+ characters, which bowtie cannot map against. You will also need to
148
+ first build a bowtie index, with something like:
148
149
$ bowtie-build SSU_candidate_db.fasta SSU_candidate_db_btindex
150
+
149
151
You might also consider changing the offrate (see
150
152
http://bowtie-bio.sourceforge.net/manual.shtml)
151
153
0 commit comments