Skip to content

[CLOUD'22] Learning to Dynamically Select the Optimal Scheduler in Cloud Computing Environments

License

Notifications You must be signed in to change notification settings

imperial-qore/MetaNet

Repository files navigation

License Python 3.7, 3.8 Hits

Open in gitpod

MetaNet

The operational cost of a cloud computing platform is one of the most significant Quality of Service (QoS) criteria for schedulers, crucial to keep up with the growing computational demands. Several data-driven deep neural network (DNN)-based schedulers have been proposed in recent years that outperform alternative approaches by providing scalable and effective resource management for dynamic workloads. However, state-of-the-art schedulers rely on advanced DNNs with high computational requirements, implying high scheduling costs. In non-stationary contexts, the most sophisticated schedulers may not always be required, and it may be sufficient to rely on low-cost schedulers to temporarily save operational costs. In this work, we propose MetaNet, a surrogate model that predicts the operational costs and scheduling overheads of a large number of DNN-based schedulers and chooses one on-the-fly to jointly optimize job scheduling and execution costs. This facilitates improvements in execution costs, energy usage and service level agreement violations of up to 11%, 43% and 13% compared to the state-of-the-art methods.

Quick Start Guide

Installation.

Use the below installation instructions or spin a Gitpod container with pre-installed dependencies.

# install prerequisites
sudo apt -y update && sudo apt install -y rsync python3-pip
pip3 install --upgrade pip
pip3 install matplotlib scikit-learn
pip3 install -r requirements.txt
export PATH=$PATH:~/.local/bin
sudo chmod 400 keys/id_rsa

# install Azure CLI
curl -sL https://aka.ms/InstallAzureCLIDeb | sudo bash

Running the MetaNet model

python3 metanet.py

Arxiv Link

A preliminary version of this work was accepted as a poster in ACM SIGMETRICS 2022: https://arxiv.org/pdf/2205.10640.pdf.

The conference version is available here: https://arxiv.org/pdf/2205.10642.pdf.

Cite this work

Our work is published in IEEE CLOUD Conference. Cite using the following bibtex entry.

@article{tuli2022metanet,
  author={Tuli, Shreshth and Casale, Giuliano and Jennings, Nicholas R.},
  journal={IEEE CLOUD}, 
  title={{MetaNet: Automated Dynamic Selection of Scheduling Policies in Cloud Environments}}, 
  year={2022}
}

License

BSD-3-Clause. Copyright (c) 2022, Shreshth Tuli. All rights reserved.

See License file for more details.

About

[CLOUD'22] Learning to Dynamically Select the Optimal Scheduler in Cloud Computing Environments

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages