Curvature Clues: Decoding Deep Learning Privacy with Input Loss Curvature (NeurIPS '24 Spotlight)

This is the official implementation for the paper "Curvature Clues: Decoding Deep Learning Privacy with Input Loss Curvature". Paper accepted at NeurIPS 2024 (Spotlight). If you use this github repo consider citing our work

@inproceedings{
    ravikumar2024curvature,
    title={Curvature Clues: Decoding Deep Learning Privacy with Input Loss Curvature},
    author={Ravikumar, Deepak and Soufleri, Efstathia and Roy, Kaushik},
    booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems},
    year={2024},
    url={https://openreview.net/forum?id=ZEVDMQ6Mu5}
}

You can find our paper here.

Environment

This code is tested and validated with python 3.9 and 3.11. To replicate our environment please use the environment.yml file provided by running

conda env update -n py3.9_curv_clues --file environment.yml

Code Flow

Training Shadow Models The first step is to train shadow models whose code can be found in ./train directory.
Compute Scores Next we compute the scores for various MIA methods, this code is found in ./precompute_scores directory. The code uses Azure blob storage to save the scores, modifications to the code maybe needed to save locally.
Results Next we fetch the precomputed scores to get the results. The code for which are in the root directory and correspond to the notebook (i.e. *.ipynb files)

Note we have released the pretrained shadow models and the scores, thus you can skip step 1 and step 2 by downloading the precomputed scores from our project page here

Training Shadow Models

Setup

Create a folder ./pretrained/<dataset name> and ./pretrained/<dataset name>/temp i.e.

mkdir pretrained
mkdir pretrained/cifar100
mkdir pretrained/cifar100/temp

and copy the training file under question to the root directory.

Training CIFAR10 shadow models

To train CIFAR10 shadow models set the data_dir variable in ./scripts/train_cifar10_shadow_models.sh and run

sh train_cifar10_shadow_models.sh

Training CIFAR100 shadow models

To train CIFAR100 shadow models set the data_dir variable in ./scripts/train_cifar10_shadow_models.sh and run

sh train_cifar100_shadow_models.sh

ImageNet shadow models

For ImageNet we use pre-trained models from Feldman and Zhang, Feldman and Zhang ImageNet ResNet50 redirect link. We provide code to convert these models to pytorch in ./train/imagenet_shadow_models directory. Set the path to ImageNet libdata/indexed_tfrecords.py.

Please download imagenet_index.npz from Feldman and Zhang and place it in build_fz_imagenet/
Use the build_imagenet.py in the build_fz_imagenet directory to convert to TFRecord dataset.
Place the datasets in the following directory structure
Download the models
Copy the files from ./train/imagenet_shadow_models to the root directory and run

python convert_imagenet_models_tf_2_torch.py --model_dir <path to where models were downloaded from Feldman and Zhang>

The train directory also has the code to train dp (train/train_dp.py) models, random subsets (train/train_cifar100_random_samples.py) and curvature subsets (train/train_cifar100_low_curv_samples.py) on CIFAR100.

Compute Scores

CIFAR100

To calculate the scores on CIFAR100 set the data_dir variable in ./scripts/precompute_cifar100_scores.sh copy the files to the root directory and run

sh precompute_cifar100_scores.sh

CIFAR10

To calculate the scores on CIFAR10 set the data_dir variable in ./scripts/precompute_cifar10_scores.sh copy the files to the root directory and run

sh precompute_cifar10_scores.sh

ImageNet

To calculate the scores on ImageNet set the copy the files to the root directory and run

sh precompute_imagenet_scores.sh

Reproducing our Results

Setup

To reproduce our results we have provided the assets here.

Extract "precomputed scores" such that it has the folwing structure and set the "precomputed_scores_dir" in config.json to <path_to_score>.

<path_to_score>/precomputed_scores/
├── cifar10
├── cifar100
├── cifar100_dp
├── cifar100_random
├── cifar100_top
└── imagenet

Download the CIFAR susbet indices to <path_to_susbet_indices> and set "subset_idxs_dir" in config.json.

For Imagenet results download the imagenet models from Feldman and Zhang, Feldman and Zhang ImageNet ResNet50 redirect link and set <path_to_fz_imagenet_models> for "imagenet_models_dir" and the extracted models should follow the folder structure below.

<path_to_fz_imagenet_models>
└── imagenet-resnet50
    └── 0.7
        └── 0
        .   ├── aux_arrays.npz
        .   └── checkpoints
        .
        .
        └── 999
            ├── aux_arrays.npz
            └── checkpoints

Download imagenet_index.npz from Feldman and Zhang's website and set the <path_to_imagenet_index.npz> for "imagenet_index_dir" in config.json.

We describe the files and implementations below

File	Description
`conditonal_mia_aug_cifar10.ipynb`	Provides the results for MIA attack using various methods and reproduces results from Table 1 for CIFAR10
`conditonal_mia_aug_cifar100.ipynb`	Provides the results for MIA attack using various methods and reproduces results from Table 1 for CIFAR100
`conditonal_mia_aug_imagenet.ipynb`	Provides the results for MIA attack using various methods and reproduces results from Table 1 for ImageNet
`conditonal_mia_aug_v_m_random.ipynb`	Provides the results for experiments under `Effect of Dataset Size` section of the paper when models are trained on random subsets
`conditonal_mia_aug_v_m_top.ipynb`	Provides the results for experiments under `Effect of Dataset Size` section of the paper when models are trained on most memorized subsets according to Feldman and Zhang
`conditonal_mia_aug_dp.ipynb`	Provides the results for experiments under `Effect of Privacy` section of the paper

For ease of reproducibility we have released our pretrained shadow models and precomputed scores used by the ipynb files on our project page linked here where you can download these models and scores files. You can extract the precomputed scores files and set the locations in config.json.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Curvature Clues: Decoding Deep Learning Privacy with Input Loss Curvature (NeurIPS '24 Spotlight)

Environment

Code Flow

Training Shadow Models

Setup

Training CIFAR10 shadow models

Training CIFAR100 shadow models

ImageNet shadow models

Compute Scores

CIFAR100

CIFAR10

ImageNet

Reproducing our Results

Setup

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
dataset_idxs/cifar100		dataset_idxs/cifar100
libdata		libdata
models		models
precompute_scores		precompute_scores
scripts		scripts
tf_utils_custom		tf_utils_custom
train		train
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
azure_blob_storage.py		azure_blob_storage.py
conditonal_mia_aug_cifar10.ipynb		conditonal_mia_aug_cifar10.ipynb
conditonal_mia_aug_cifar100.ipynb		conditonal_mia_aug_cifar100.ipynb
conditonal_mia_aug_dp.ipynb		conditonal_mia_aug_dp.ipynb
conditonal_mia_aug_imagenet.ipynb		conditonal_mia_aug_imagenet.ipynb
conditonal_mia_aug_v_m_random.ipynb		conditonal_mia_aug_v_m_random.ipynb
conditonal_mia_aug_v_m_top.ipynb		conditonal_mia_aug_v_m_top.ipynb
config.json		config.json
environment.yml		environment.yml

License

DeepakTatachar/Curvature-Clues

Folders and files

Latest commit

History

Repository files navigation

Curvature Clues: Decoding Deep Learning Privacy with Input Loss Curvature (NeurIPS '24 Spotlight)

Environment

Code Flow

Training Shadow Models

Setup

Training CIFAR10 shadow models

Training CIFAR100 shadow models

ImageNet shadow models

Compute Scores

CIFAR100

CIFAR10

ImageNet

Reproducing our Results

Setup

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages