-
Notifications
You must be signed in to change notification settings - Fork 21
Add more model extraction attacks #68
Description
Is your feature request related to a problem? Please describe.
We want every model extraction attack to be achievable in PrivacyRaven. This does not include side channel, white-box, full or partial prediction, or explanation-based attacks.
Describe the solution you'd like.
PrivacyRaven has three interfaces for attacks:
- The core interface defines each attack parameter individually.
- The specific interface runs a predefined attack configuration.
- The cohesive interface runs every possible attack.
A user should be able to run the attack in every interface; this means that all the building blocks for the attack should be contained within PrivacyRaven. For example, new synthesizers or subset selection strategies for a specific attack should be added, so that it can be applied using the core interface.
If you would like to implement an attack, comment with the name of the paper. Then, create a new issue referencing this issue with the name of the paper in the title.
Detail any additional context.
This is a list of papers describing model extraction attacks that should be added to PrivacyRaven.
- Knockoff nets: Stealing functionality of black-box models: Blocked on Add retraining and subset sampling to extraction #10
- PRADA: protecting against DNN model stealing attacks: Missing synthesizer
- CloudLeak: Large-Scale Deep Learning Models Stealing Through Adversarial Examples: Missing some synthesizers
- ACTIVETHIEF: Model Extraction Using Active Learning and Unannotated Public Data: Blocked on Add retraining and subset sampling to extraction #10
- Thief, Beware of What Get You There: Towards Understanding Model Extraction Attack
- Special-Purpose Model Extraction Attacks: Stealing Coarse Model with Fewer Queries
- Model Extraction Attacks on Graph Neural Networks: Taxonomy and Realization
- ES Attack: Model Stealing against Deep Neural Networks without Data Hurdles
- Simulating Unknown Target Models for Query-Efficient Black-box Attacks
- Thieves on Sesame Street! Model Extraction of BERT-based APIs