σ-zero: Gradient-based Optimization of ℓ0-norm Adversarial Examples

This code is the official PyTorch implementation of the σ-zero: Gradient-based Optimization of ℓ0-norm Adversarial Examples.

The leftmost plot shows an instance of σ-zero’s execution on a two-dimensional problem. The initial point x (red dot) is modified via gradient descent to find the adversarial example x* (green star) while minimizing the number of perturbed features (i.e., the ℓ0 norm of the perturbation). The gray lines surrounding x demarcate regions where the ℓ0 norm is minimized. The rightmost plot shows the adversarial images (top row) and the corresponding perturbations (bottom row) found by σ-zero during the three steps highlighted in the leftmost plot, alongside their prediction and ℓ0 norm.

Dependencies and Reproducibility

Python ≥ 3.11.*
PyTorch ≥ 2.0.*
torchvision ≥ 0.15.*
RobustBench ≥ 1.1
adversarial-library ≥ 0.2.0
torchattacks ≥ 3.5.1

In order to improve the reproducibility of our experiments, we released our anaconda environment, containing all dependencies and corresponding SW versions. The environment can be installed by running the following command:

conda env create -f env.yml

Once the environment is created, we can use it by typing conda activate sigmazero.

Code Folding

The code is structured as follows:

configs/, where experimental configurations are stored.
data/, where datasets are downloaded and stored.
imagenet/val/, where the validation set of imagenet needs to be stored.
models/, where models are downloaded and stored.
results/, contains the results of the experiments created by the main script.
utils/, contains wrappers for model and attacks.
datasets.py, used to load datasets.
model.py, used to download models to test.
sigma_zero.py, contains the official implementation of σ-zero attack.
utilities.py, contains useful function to run experiments.
main.py, executes experiment in {args.config} on device {args.device}.

Running Experiments

To execute an experiment where a number of attacks generate adversarial examples for a selected model, firstly a configuration must be created:

{
    "seed": 1233,
    "experiments": [
        {
            "attack": {
                "name": "sigma_zero",
                "params": {
                    "steps":100
                }
            },
            "dataset": "mnist",
            "model": "smallcnn_ddn",
            "n_samples": 100,
            "batch_size": 16
        }
    ]
}

Experiments is an array that can contain different attack configurations like the one above. Experiment are then run by calling the main function:

python main.py --device=cpu --config=configs/config_single_attack.json

After having executed the main function, a folder structure inside results/" will be created containing results, salient statistics and some resulting adversarial images.

Usage

The σ-zero attack is implemented as a function, so it can be called directly in the following way:

from sigma_zero_attack import sigma_zero
adv_samples = sigma_zero(model=model, inputs=inputs, labels=labels)

with required parameters:

model: the model that produces logits with inputs in $[0, 1]$;
inputs: the samples to attack in $[0, 1]$;
labels: the ground-truth labels for the samples.

and more which are optional:

steps: number of iteration steps made by the attack;
lr: learning rate parameter for the optimizer;
sigma: $\sigma$ parameter for the $\ell_0$-norm approximation;
threshold: initial value for the dynamic thresholding;
verbose: flag used to show informations during the optimization process;
epsilon_budget: threshold for the early stopping mechanism to stop the optimization of an adversarial examples when an adversarial examples with a lower perturbation budget than $\epsilon$ is found;
grad_norm: which norm is used to perform the normalization of the gradients.

Acknowledgements

The authors would like to thank the contributors of adversarial-library, RobustBench and Torchattacks for having facilitated the development of this project.

$\sigma$-zero has been partially developed with the support of European Union’s ELSA – European Lighthouse on Secure and Safe AI, Horizon Europe, grant agreement No. 101070617, and Sec4AI4Sec - Cybersecurity for AI-Augmented Systems, Horizon Europe, grant agreement No. 101120393.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
configs		configs
git_images		git_images
models		models
utils		utils
.gitignore		.gitignore
attacks.py		attacks.py
config_generator.py		config_generator.py
dataset.py		dataset.py
env.yml		env.yml
main.py		main.py
model.py		model.py
readme.md		readme.md
sigma_zero_attack.py		sigma_zero_attack.py
utilities.py		utilities.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

σ-zero: Gradient-based Optimization of ℓ0-norm Adversarial Examples

Dependencies and Reproducibility

Code Folding

Running Experiments

Usage

Acknowledgements

About

Releases

Packages

Languages

Cinofix/sigma-zero-adversarial-attack

Folders and files

Latest commit

History

Repository files navigation

σ-zero: Gradient-based Optimization of ℓ0-norm Adversarial Examples

Dependencies and Reproducibility

Code Folding

Running Experiments

Usage

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages