-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How can I use metaeuk to annotation genome without reference #46
Comments
Hello, The way to do it is to download or construct a reference database to run against. What do you know about your genome? What taxonomic group is it? I could try to provide further advice based on your answer :) Best, |
Hello, thank you for developing this software. I would like to follow up this question. I obtained contigs with a length greater than 1kbp from the metagenome data of soil contaminated with heavy metals. I noticed that there are many reference datasets of mmseqs recommended by you, some of which are nucleic acid databases (https://github.com/soedinglab/MMseqs2/wiki#downloading-databases). May I ask which database is the most suitable for me (SILVA )? |
Hi, As a reference DB MetaEuk takes in either protein or protein profiles. Therefore the nucleotide DBs available thorough the Choosing the right protein/protein profile DB depends on your scientific goal. Here are two ideas I have, based on the details you provided:
You can also have a look at Busco if you are interested in estimating the geneomic completeness of specific organisms via single-copy marker genes of various phylogenetic groups. BUSCO uses MetaEuk internally. Best, |
Thank you very much for your detailed reply. I'll try the UniRef50 and SRC databases first, hopefully with good results. Best |
Dear developer and other users,
As the title says, I wanna use metaeuk to annotation genome without reference. But I cannot find how to deal with it in manual.
Best
Nana7m1
The text was updated successfully, but these errors were encountered: