Code to make it easy to import heterogeneous data into an EnsEMBL database.
The instructions below will help you get an Ensembl database and website up and running in an afternoon - with four Lepidopteran genomes mirrored from Ensembl Metazoa plus a fresh import of the genome of the winter moth Operophtera brumata direct from publicly hosted .gff and .fasta files.
This is a sister project to easy-mirror (included as a submodule), which makes it possible to set up a mirror of any Ensembl or Ensembl Genomes (including Bacteria, Metazoa, Fungi, Plants and Protists) species in four simple steps that can be run in less than an hour on a fresh Ubuntu installation.
The latest and most complete documentation for both projects is available at easy-import.readme.io
sudo apt-get update
sudo apt-get upgrade
sudo apt-get install git
cd ~
git clone --recursive https://github.com/lepbase/easy-import ei
cd ~/ei/em
sudo ./install-dependencies.sh ../conf/setup.inicd ~/ei/em
./setup-databases.sh ../conf/setup-db.inicd ~/ei/em
./update-ensembl-code.sh ../conf/setup.iniUsing core-import.ini will install a new core database for the winter moth Operophtera brumata
mkdir ~/import
cd ~/import
perl ../ei/core/summarise_files.pl ../ei/conf/core-import.inicd ~/import
perl ../ei/core/import_sequences.pl ../ei/conf/core-import.ini
perl ../ei/core/import_sequence_synonyms.pl ../ei/conf/core-import.inicd ~/import
perl ../ei/core/prepare_gff.pl ../ei/conf/core-import.inicd ~/import
perl ../ei/core/import_gene_models.pl ../ei/conf/core-import.inicd ~/import
perl ../ei/core/import_blastp.pl ../ei/conf/example.ini ../ei/conf/core-import-extra.ini
perl ../ei/core/import_repeatmasker.pl ../ei/conf/example.ini ../ei/conf/core-import-extra.ini
perl ../ei/core/import_interproscan.pl ../ei/conf/example.ini ../ei/conf/core-import-extra.ini
perl ../ei/core/import_cegma_busco.pl ../ei/conf/example.ini ../ei/conf/core-import-extra.inicd ~/import
perl ../ei/core/export_sequences.pl ../ei/conf/core-import.ini
perl ../ei/core/export_json.pl ../ei/conf/core-import.inicd ~/import
perl ../ei/core/verify_translations.pl ../ei/conf/core-import.inicd ~/import
perl ../ei/core/index_database.pl ../ei/conf/core-import.iniedit setup.ini to add operophtera_brumata_v1_core_31_84_1 to [DATA_SOURCE] SPECIES_DBS
cd ~/ei/em
./update-ensembl-code.sh ../conf/setup.inicd ~/ei/em
./reload-ensembl-site.sh ../conf/setup.ini