ML_Toolkit

Final project for CS1660 Intro to Cloud Computing

Overview

This is toolkit of containerized applications that are often used in Machine Learning and Cloud Computing. This was completed as an open ended project with the only requirments being containerizing the following applications, and deploying them in some sort of micrservice architecture:

R Studio
Spyder
Git GUI tool
Jupyter Notebook
Orange
VSCode IDE
Apache Hadoop
Apache Spark
SonarQube and SonarScanner
Tensorflow
Markdown editor

Demo Video: https://pitt-my.sharepoint.com/:v:/g/personal/ngl8_pitt_edu/EXLmBRotcDREpNDAA3XJ1PgBOeCo53Y7QkQIv6sXBGHbmA?e=aqtqa0

System considerations

This has been tested on Ubuntu 20.04. There is no dedicated GPU or CUDA support at this moment, X forwarding has only been verified working on intel integrated graphics.

Dependencies

Docker
Docker-compose
golang-docker-credential-helpers (apt package)

Usage

A fresh build takes about 20 minutes. You can either buserver - Contains a Node webserver that listens for POST requests (button presses) and connects to the relevant service to trigger an application open. ild the images yourself or pull them from my dockerhub.

Please note that the SHARED folder is mounted in each container!

Setup instructions

First things first, run this command to let Docker display the GUIs:

xhost +local:root

Add a shared directory, and optionally, add any files or repos you'd like in the containers to the SHARED dir

cd ML_Toolkit
mkdir SHARED

Either build the images yourself (this will take a while), or pull them from my dockerhub (this will be way faster).

Build yourself

docker-compose build
docker-compose up

Use Dockerhub

docker-compose -f dockerhub.yml up

Open any web browser to http://localhost/main.html, here you'll be able to select different services to run.
For subsequent runs, be sure to run docker-compose down before running docker-compose up

Regards

Tableau, Sonarscanner, Hadoop, and Spark may take an extra minute to start up
Tableau needs a license to activate it
Tableau login - username: tableau | password: tableau
Dockerhub - https://hub.docker.com/repository/docker/noah710/ml_toolkit

Implementation details

comm_base - This dir holds a listener which each services uses to listen for triggers to open the application.
server - Contains a Node webserver that listens for POST requests (button presses) and connects to the relevant service to trigger an application open.
services - Contains dirs for each service, containing at minimum a Dockerfile and open.sh script. open.sh script is called when the node container makes a request to the services container. This is the command that makes the application window open.
compose-common.yml - Contains default configuration for a GUI container. Services can extend this for less repetition.

When each service spins up, it runs the comm_base listener which listens for triggers from the web server.
When the listener hears a trigger, it calls a subprocess to run the open.sh script, which is manually configured for each service.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
comm_base		comm_base
server		server
services		services
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
compose-common.yml		compose-common.yml
demo.png		demo.png
docker-compose.yml		docker-compose.yml
dockerhub.yml		dockerhub.yml
fix_docker-credential-secretservice.sh		fix_docker-credential-secretservice.sh
questions.txt		questions.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML_Toolkit

Overview

System considerations

Usage

Regards

Implementation details

About

Releases

Packages

Languages

License

noah710/ML_Toolkit

Folders and files

Latest commit

History

Repository files navigation

ML_Toolkit

Overview

System considerations

Usage

Regards

Implementation details

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages