Skip to content

Commit

Permalink
add hmms to readme
Browse files Browse the repository at this point in the history
  • Loading branch information
oschwengers committed Sep 24, 2024
1 parent 4029e04 commit 0eb2bbe
Showing 1 changed file with 22 additions and 0 deletions.
22 changes: 22 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -274,6 +274,27 @@ dbxrefs | `<empty>`, `db:id`, `,` separated list | `VFDB:VF0511`

Protein sequences provided in short Fasta or GenBank format are searched with default thresholds of 90%, 80% and 80% for minimal identity, query and subject coverage, respectively.

#### User-provided HMMs

Bakta accepts user-provided trusted HMMs via `--hmms` in HMMER's text format. If set, Bakta will adhere to the *trusted cutoff* specified in the HMM header. In addition, a max. evalue threshold of 1e-6 is applied. By default, Bakta used the HMM description line as a product description. Further information can be provided via the HMM description line using the *short* format as explained above.

```bash
# default
HMMER3/f [3.1b2 | February 2015]
NAME id
ACC id
DESC product
LENG 435
TC 600 600

# short
NAME id
ACC id
DESC gene~~~product~~~dbxrefs
LENG 435
TC 600 600
```

### Output

Annotation results are provided in standard bioinformatics file formats:
Expand Down Expand Up @@ -389,6 +410,7 @@ Annotation:
Replicon information table (tsv/csv)
--regions REGIONS Path to pre-annotated regions in GFF3 or Genbank format (regions only, no functional annotations).
--proteins PROTEINS Fasta file of trusted protein sequences for CDS annotation
--hmms HMMS HMM file of trusted hidden markov models in HMMER format for CDS annotation
--meta Run in metagenome mode. This only affects CDS prediction.

Workflow:
Expand Down

0 comments on commit 0eb2bbe

Please sign in to comment.