QFL: Quantum Federated Learning Framework

The project is based on the code of SAFEFL and QMP-SPDZ and follows their general structure.

The main branch is an unchanged copy of the SAFEFL.

The dev branch is the implementation of QFL framework that includes the following changes:

Integration of QSMC module in which three advanced quantum communication technologies -QRNG, QKD, and QOKD- are deployed along with a Key Management System (KMS) to enhance security.
Integration of a GCN to predict aqueous solubility of drug molecules using ESOL dataset.

The design of QFL framework is described in this paper: Quantum-Secured Federated Learning for Solubility Prediction in Drug Molecules.

Installation

Clone the QFL repository and checkout to dev branch:

git clone https://github.com/Quantum-SMC/QFL.git
cd QFL
git checkout dev

Run the qsmc_install.sh file in order to install the QSMC module:

qsmc_install.sh

Clone the KMS repository and build it as it is explained.

sudo apt install cmake
sudo apt install libexplain-dev
git clone https://github.com/diogoftm/minimal-etsi-qkd-004.git
cd minimal-etsi-qkd-004
cd etsi-gs-qkd-004-c
mkdir cmake-build-dir
cd cmake-build-dir
cmake ..
make

Generate self-signed certificates for server. In any TLS server the server needs a certificate files (private key and certificate):

# Generate CA
CA_NAME_=your_name_CA # CA name
openssl req -x509 -nodes -newkey rsa:4096 -sha256 -days 3650 -keyout $CA_NAME_.key -out $CA_NAME_.pem -subj "/CN=$CA_NAME_"

# Generate certificate
CN_=127.0.0.1  #localhost
openssl req -new -newkey rsa:4096 -nodes -keyout $CN_.key -out $CN_.csr -subj "/CN=$CN_" -addext "subjectAltName=IP:$CN_" || openssl req -new -newkey rsa:4096 -nodes -keyout $CN_.key -out $CN_.csr -subj "/CN=$CN_" -addext "subjectAltName=DNS:$CN_"

# Sign certificate with the CA
openssl x509 -req -in $CN_.csr -CA $CA_NAME_.pem -CAkey $CA_NAME_.key -CAcreateserial -out $CN_.pem -days 3650 -sha256

# Check certificate
openssl x509 -in $CN_.pem -text -noout  # openssl x509 -in file.pem -enddate -noout 
./server_example

Run the server_example and keep the terminal open as a client (QSMC) will connect to it.

./server_example

Go to the ssl folder and run the following:

cd ..
cd ssl
./generate_ca_and_selfsigned_cert.sh

After generating the certificate inside the ssl directory, you need to give the path of these files to the Env.env during the next step.

Sent environment variables in ENV.env by providing the path of certificate. The Env.env file should look as follows:

# Make sure to update all the variables to suit your setup
export KEY_REQUEST_INTERFACE='004'

# Both for ETSI 004
export KMS_URI='127.0.0.1:25575'
export SENDER_SAE_CRT='./ssl/127.0.0.1.pem'
export SENDER_SAE_KEY='./ssl/127.0.0.1.key'
export RECEIVER_SAE_CRT='./ssl/127.0.0.1.pem'
export RECEIVER_SAE_KEY='./ssl/127.0.0.1.key'

# Extra for ETSI 004
export SENDER_SAE_ID='qkd//app1@aaaaaaaa-aaaa-aaaa-aaaa-aaaaaaaaaaaa'
export RECEIVER_SAE_ID='qkd//app2@bbbbbbbb-bbbb-bbbb-bbbb-bbbbbbbbbbbb'

Now, set the environment variables by running:

source ENV.env

Build the QSMC module:

cd qsmc
make -j 8 tldr  # This command will rename OTKeys_004/ to OTKeys.

make -j 8 mascot-party.x

Create an IP file called players.txt. In this file the IP, base port and KMS application sae id need to be defined. For example:

# players.txt
127.0.0.1:1234 qkd//app1@aaaaaaaa-aaaa-aaaa-aaaa-aaaaaaaaaaaa - -
127.0.0.1:1238 qkd//app2@bbbbbbbb-bbbb-bbbb-bbbb-bbbbbbbbbbbb 550e8400-e29b-41d4-a716-446655440000 2

Download the ESOL dataset into the QFL directory by running

Finally run the main.py to start the training process:

python main.py

Requirements

Perl
GMP
MPIR
OpenSSL
Boost
Python
NumPy
Pandas
PyTorch
Torchvision
Torch Geometric
Matplotlib
Scikit-Learn
HDBSCAN
RDKit

Credits

The PyTorch module in QFL is based on code of SAFEFL from paper.

The QSMC module in QFL is based on the code of QMP-SPDZ.

The GCN model is based on the code of GCN.

SAFEFL: MPC-friendly framework for Private and Robust Federated Learning

This project implements several federated learning aggregation rules and attacks. We added support for linear regression on the HAR dataset.

Additionally, we implemented FLTrust and FedAvg in the MP-SPDZ Multi-Party Computation Framework.

The project is based on code by the authors of FLTrust and follows their general structure. The original code is available here and uses the machine learning framework MXNet. We adapted the existing code to use PyTorch and extended it.

Aggregation rules

The following aggregation rules have been implemented:

All aggregation rules are located in aggregation_rules.py as individual functions and operate on the local gradients and not on the actual local models. Working with the gradients or working with the models is equivalent as long as the global model is known. All aggregation rules that normally work on the local models have been modified to work on the local gradients instead.

To add an aggregation rule you can add the implementation in aggregation_rules.py. To actually use the aggregation rule during training you must also add a case for the aggregation rule in the main function of the main.py file. This calls the aggregation rule and must return the aggregated gradients.

Attacks

To evaluate the robustness of the aggregation rules we also added the following attacks.

The implementation of the attacks are all located in attacks.py as individual functions.

To add a new attack the implementation can simply be added as a new function in this file. For attacks that are called during the aggregation the signature of the function must be the same format as the other attacks. This is because the attack function call in the training process is overloaded and which attack is called is only determined during runtime. The attack name must also be added to the get_byz function in main.py. Attacks that only manipulate training data just need to be called before the training starts and don't need a specific signature.

Models

We implemented multiclass linear regression classifier.

The model is in a separate file in the models folder of this project.

To add models a new file containing a class that defines this classifier must be added. Additionally, in main.py the get_net function needs to be expanded to enable the selection of this model.

Datasets

We implemented the HAR dataset and as it is not implemented by PyTorch per default. It must be downloaded with the provided loading script in the data folder.

Adding a new dataset requires adding the loading to the load_data function in data_loading.py. This can either be simply done by adding an existing dataloader from PyTorch or requires custom data loading like in the case with the HAR dataset. Additionally, the size of the data examples and the number of classes need to be added to the get_shapes function to properly configure the model. Furthermore, the assign_data function needs to be extended to enable assigning the test and train data to the individual clients. Should the evaluation require running the new dataset with the scaling attack, which adds backdoor trigger patterns to the data examples the following functions also need to be extended:

scaling_attack_insert_backdoor
add_backdoor

Both of these are located in attacks.py.

Multi-Party Computation

To run the MPC Implementation the code for MP-SPDZ needs to be downloaded separately using the installation script mpc_install.sh. The following protocols are supported:

Semi2k uses 2 or more parties in a semi-honest, dishonest majority setting
SPDZ2k uses 2 or more parties in a malicious, dishonest majority setting
Replicated2k uses 3 parties in a semi-honest, honest majority setting
PsReplicated2k uses 3 parties in a malicious, honest majority setting

How to run?

The project can be simply cloned from git and then requires downloading the HAR dataset as described in the dataset section.

The project takes multiple command line arguments to determine the training parameters, attack, aggregation, etc. is used. If no arguments are provided the project will run with the default arguments. A description of all arguments can be displayed by executing:

python main.py -h

Requirements

The project requires the following packages to be installed:

Python 3.8.13
Pytorch 1.11.0
Torchvision 0.12.0
Numpy 1.21.5
MatPlotLib 3.5.1
HDBSCAN 0.8.28
Perl 5.26.2

All requirements can be found in the requirements.txt.

Credits

This project is based on code by Cao et al. the authors of FLTrust and is available here

We thank the authors of Romoa for providing an implementation of their aggregation.

We used the open-sourced implementations of the Min-Max and Min-Sum attack.

For the implementation of Flame we used the scikit-learn implementation of HDBSCAN by McInnes et al.

The MPC Framework MP-SPDZ was created by Marcel Keller.

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

QFL: Quantum Federated Learning Framework

Installation

Requirements

Credits

SAFEFL: MPC-friendly framework for Private and Robust Federated Learning

Aggregation rules

Attacks

Models

Datasets

Multi-Party Computation

How to run?

Requirements

Credits

License

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
models		models
qsmc		qsmc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
aggregation_rules.py		aggregation_rules.py
attacks.py		attacks.py
data_loaders_esol.py		data_loaders_esol.py
main.py		main.py
qsmc_install.sh		qsmc_install.sh
requirements.txt		requirements.txt
utils.py		utils.py

License

Quantum-SMC/QFL

Folders and files

Latest commit

History

Repository files navigation

QFL: Quantum Federated Learning Framework

Installation

Requirements

Credits

SAFEFL: MPC-friendly framework for Private and Robust Federated Learning

Aggregation rules

Attacks

Models

Datasets

Multi-Party Computation

How to run?

Requirements

Credits

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages