Getting started

Requirements

Docker - 20.10.10
Determined AI - 0.17.2
Polyxon CE - 1.11.2
RAM - 64 GB
CPU - 16 cores
Nvidia GPU with support for atleast CUDA 10.0 and atleast 16 GB memory (RTX A4000)

Folder Structure


`src/data`	Source code of the scripts used to generate data for experiments.
`src/models`	Source code of the unsupervised models described in the study.
`src/experiments`	Source code of the supervised models described in the study.
`configs`	Determined configuration files for running experiments on the determined cluster.
`polyaxon_configs`	Polyaxon configuration files for running experiments on the k8s cluster.
`main.py`	Entry point script used to run scripts locally.

Getting started.

Download and extract the data needed to run experiments

# download the data
wget https://research.bioinformatics.udel.edu/iptmnet_data/downloads/ikg_v2_data.tar.gz

# extract to /data/ml_data/ikg_v2_data
tar -xf ikg_v2_data.tar.gz r -C /data/ml_data/ikg_v2_data

Before starting the experiments, you need to build the docker containers.

# change to the docker directory
cd docker

# build the docker container
bash build.sh

Start the docker container and open an interactive shell into it.

# start the container
docker-compose up -d

# start an interactive shell into the container
docker exec -it ikg-dev /bin/bash

Generate an embedding using triple walk skip gram algorithm

det experiment create configs/triples_walk_embedder_const_mul_seeds.yaml .

Once the training is complete, retrieve the embeddings from checkpoint folder. The embeddings will be in the form protein_head_embeddings_{fold_number}.csv and protein_tail_embeddings_{fold_number}.csv.
To generate predictions use the polyaxon configs to run the prediction tasks on the kubernetes cluster.

# run the prediction task on the kubernetes cluster
polyaxon run -p ikg_v2 -f ./polyaxon_configs/make_predictions.yml -u

# run the prediction task locally
export POLYAXON_NO_OP=1
python src/main.py --c "make_predictions"

# After the task completes, check for a file named `predicted_edges.csv` in the root folder of this project.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
configs		configs
csrc		csrc
docker		docker
libs/Eigen		libs/Eigen
polyaxon_configs		polyaxon_configs
src		src
.detignore		.detignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.polyaxonignore		.polyaxonignore
LICENSE		LICENSE
README.md		README.md
build_ext.sh		build_ext.sh
clean_ray.sh		clean_ray.sh
data_hash.txt		data_hash.txt
docker-compose.yml		docker-compose.yml
environment.yml		environment.yml
freeze.sh		freeze.sh
ikg_native.cpython-37m-x86_64-linux-gnu.so		ikg_native.cpython-37m-x86_64-linux-gnu.so
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
run_docker.sh		run_docker.sh
setup.py		setup.py
tensorboard_clean.sh		tensorboard_clean.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Getting started

Requirements

Folder Structure

Getting started.

Data and predictions.

About

Releases 1

Packages

Languages

License

udel-cbcb/ikg_v2_public

Folders and files

Latest commit

History

Repository files navigation

Getting started

Requirements

Folder Structure

Getting started.

Data and predictions.

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages