FDxKafka

Repository for holding our PoC for a scalable Federated Learning Implementation using Kafka

Report

Please take a look at our written report which goes in more detail about our proposal and implementation.

Get Started

Install dependencies

Dependencies are stored in a requirements.txt file.

pip install -r requirements.txt

Start the server

Kafka:

python fd_engine/server.py --numrounds 5 --broker 10.138.0.6:9092

gRPC

python fd_engine/server.py --grpc --numrounds 5 --broker "[::]:8080"

Usage: server.py [-h] [--broker BROKER] [--minclients MINCLIENTS]
                 [--numrounds NUMROUNDS] [--grpc]

Arguments:
  -h, --help            Show this help message and exit
  --broker BROKER       Address of server
  --minclients MINCLIENTS
                        Minimum number of clients for training
  --numrounds NUMROUNDS
                        minimum number of training rounds
  --grpc                Use gRPC as network channel. Default False

Start training on the client

python fd_engine/client.py --broker 10.138.0.6:9092

Local testing in Docker

Start local kafka and zookeeper

docker-compose -f kafka_cluster/docker-compose.yml up -d

Stop kafka

docker-compose -f kafka_cluster/docker-compose.yml down

Broker url is going to be 127.0.0.1:9091

Read kafka server logs

docker logs broker -f

Evaluation

In order to compare our implementation, we use the existing gRPC channel as the benchmark. Then we perform the following:

Spin up a server:
1. 5 training rounds
2. 100 minimum devices
Spin up clients in the following way:
1. Spin up 50 clients using cloud function
2. Spin up 50 clients using local setup, and GCP instances

In our evaluation, we measure the following for each of gRPC and kafka channels:

Total # of transmitted messages
Total time to finish 5 rounds of training
Total time for model to be updated on all devices

Running evaluations

Start up the server as outlined above.
Spin up the cloud function clients: (Make sure you're in evaluations directory: npm test
From the root directory, run python evaluations/stress_test.py This terminal window will report the above metrics.

Results

gRPC

Total time: FL finished in 178.526
Training start time: 22:35:15,361
Training end time: 22:35:28
Total training time: 13s

Kafka

Total time: FL finished in 188.701
Training start time: 22:46:27,791
Training end time: 22:46:45,249
Total training time: 18s

Cloud functions

To deploy the current directory to GCP as a cloud function, run the following:

gcloud functions deploy kafkaclient --trigger-http --allow-unauthenticated --runtime python37 --entry-point handler --region us-west1

Entrypoint will be the handler function in main.py.

Testing parameters in JSON format are as follows:

{
  "broker" : "34.105.38.178:9091",
  "client_id" : "12345",
  "channel" : "kafka"
}

Logs

Connecting to kafka server through ssh

$ ssh [email protected]

Check kafka logs

$ less /opt/bitnami/kafka/logs/server.log

Check zookeeper logs

$ less /opt/bitnami/zookeeper/logs/zookeeper.out

Name		Name	Last commit message	Last commit date
Latest commit History 133 Commits
evaluation		evaluation
fd_engine		fd_engine
flower @ 789e356		flower @ 789e356
flwr		flwr
kafka_cluster		kafka_cluster
kafka_consumer		kafka_consumer
kafka_producer		kafka_producer
.gcloudignore		.gcloudignore
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
env.yaml		env.yaml
main.py		main.py
report.pdf		report.pdf
requirements.txt		requirements.txt
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FDxKafka

Report

Get Started

Install dependencies

Start the server

Kafka:

gRPC

Start training on the client

Local testing in Docker

Evaluation

Running evaluations

Results

gRPC

Kafka

Cloud functions

Logs

Connecting to kafka server through ssh

Check kafka logs

Check zookeeper logs

About

Releases

Packages

Contributors 2

Languages

amirhmk/FDxKafka

Folders and files

Latest commit

History

Repository files navigation

FDxKafka

Report

Get Started

Install dependencies

Start the server

Kafka:

gRPC

Start training on the client

Local testing in Docker

Evaluation

Running evaluations

Results

gRPC

Kafka

Cloud functions

Logs

Connecting to kafka server through ssh

Check kafka logs

Check zookeeper logs

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages