Skip to content

Commit 483164c

Browse files
author
Chris Miller
committed
adjusted README to reflect change to using emirge_makedb.py
for building default SSU database.
1 parent b0f18fb commit 483164c

File tree

1 file changed

+16
-14
lines changed

1 file changed

+16
-14
lines changed

Diff for: README.txt

+16-14
Original file line numberDiff line numberDiff line change
@@ -63,7 +63,7 @@ EMIRGE expects the following programs to be installed and available in your path
6363
-usearch (www.drive5.com/usearch/ -- tested with usearch version 6.0.203; versions earlier than this are incompatible).
6464
-samtools (http://samtools.sourceforge.net/ -- tested with verison 0.1.18)
6565
-bowtie (http://bowtie-bio.sourceforge.net/index.shtml -- tested with version 0.12.7 and 0.12.8)
66-
66+
-vsearch (https://github.com/torognes/vsearch -- optional, but required if you use emirge_makedb.py)
6767
INSTALLATION
6868
------------------------------
6969

@@ -132,20 +132,22 @@ CANDIDATE SSU DATABASE
132132

133133
You can download a standard candidate SSU database by running the
134134
following command:
135-
python emirge_download_candidate_db.py
136-
137-
This script is included with EMIRGE. The current version of this
138-
database was made using Silva release SSURef_111_NR
139-
(http://www.arb-silva.de/). Sequences were clustered using uclust at
140-
97% sequence identity, short and long sequences were removed, and
141-
non-standard characters were changed to be within {ACTG} (using
142-
utils/fix_nonstandard_chars.py).
143-
144-
You can use any reference SSU database with emirge, though this one is
145-
recommended. No matter your choice, you should run
146-
utils/fix_nonstandard_chars.py on your fasta file. You will also need
147-
to first build a bowtie index, with something like:
135+
python emirge_makedb.py
136+
137+
This script is included with EMIRGE, and by default downloads the
138+
current version of the Silva small subunit rRNA database
139+
(http://www.arb-silva.de/). Sequences are clustered by default using
140+
vsearch at 97% sequence identity, short and long sequences are
141+
removed, and non-standard characters are changed to be within {ACTG}.
142+
See emirge_makedb.py --help for details.
143+
144+
You can use any reference SSU database with emirge, though SILVA is
145+
recommended. If you use another database, you should run
146+
utils/fix_nonstandard_chars.py on your fasta file to remove ambiguous
147+
characters, which bowtie cannot map against. You will also need to
148+
first build a bowtie index, with something like:
148149
$ bowtie-build SSU_candidate_db.fasta SSU_candidate_db_btindex
150+
149151
You might also consider changing the offrate (see
150152
http://bowtie-bio.sourceforge.net/manual.shtml)
151153

0 commit comments

Comments
 (0)