Continual Forgetting for Pre-trained Vision Models (CVPR 2024)

This is the official implementation of GS-LoRA (CVPR 2024). GS-LoRA is effective, parameter-efficient, data-efficient, and easy to implement continual forgetting, where selective information is expected to be continuously removed from a pre-trained model while maintaining the rest. The core idea is to use LoRA combining group Lasso to realize fast model editing. For more details, please refer to:

Continual Forgetting for Pre-trained Vision Models [paper] [video] [video in bilibili]

‪Hongbo Zhao, Bolin Ni, ‪Junsong Fan‬, ‪Yuxi Wang‬, ‪Yuntao Chen‬, ‪Gaofeng Meng, ‪Zhaoxiang Zhang‬

Practical Continual Forgetting for Pre-trained Vision Models (An extension of this paper, where we consider more practical scenarios including few-shot settings and missing class settings) [paper]

‪Hongbo Zhao, Fei Zhu, Bolin Ni, Feng Zhu, ‪Gaofeng Meng, ‪Zhaoxiang Zhang‬

Method

We incorporate a set of LoRA modules in each continual forgetting task and propose a sparse structure selection strategy and prototype regularization to achieve accurate and few modifications. (Left) All LoRA modules are added in the Linear layers of FFN in the Transformer blocks and we regard the LoRA modules in a Transformer block as one group. We use group sparse regularization ( $\mathcal{L}{structure}$ ) to automatically select LoRA groups. The purple groups are selected to modify and the white groups are neglected. The pre-trained model (including Transformer blocks and other parts) is frozen and only LoRA groups are trainable. (Right) To achieve selective forgetting, we utilize selective forgetting and knowledge retention ( $\mathcal{L}{data}$ ). To further extend our method to more practical scenarios, we introduce prototype regularization $\mathcal{L}_{pro}$. We use the original model to calculate the prototype of each class and pull away logits from its original prototype for each forgotten class and pull in logits from its own prototype for the remaining classes.

Experimental results

Single-step Forgetting

Continual Forgetting

a. Continual forgetting results for face recognition

b. Continual forgetting results on ImageNet100

Practical forgetting (missing class settings)

Getting Started

Installation

a. Clone this repository

git clone https://github.com/bjzhb666/GS-LoRA.git
cd GS-LoRA

b. Install the environment

conda create -n GSlora python=3.9
pip install -r requirements.txt

c. Prepare the datasets

c.1 CASIA-100 dataset

mkdir data
cd data
unzip data.zip

You can get our CASIA-100 in https://drive.google.com/file/d/16CaYf45UsHPff1smxkCXaHE8tZo4wTPv/view?usp=sharing, and put it in the data folder.

Note: CASIA-100 is a subdataset from CASIA-WebFace. We have already split the train/test dataset in our google drive.

c.2 ImageNet100 dataset

We get ImageNet100 dataset from ImageNet100. Then we should change the folders' names. Please refer to this post for details: #4 (comment).

We get imagenet_classes.txt from pytorch.

c.3 Final File structure

.
├── faces_webface_112x112_sub100_train_test
│   ├── test
│   └── train
├── imagenet100
│   ├── imagenet_classes.txt
│   ├── test
│   └── train
└── Labels.json

Pretrain a Face Transformer

mkdir result
cd result

You can use our pre-trained Face Transformer directly. Download the pre-trained weight and put it into the result folder

Or you can train your own pre-trained models.

Your result folder should be like this:

result
└── ViT-P8S8_casia100_cosface_s1-1200-150de-depth6new
    ├── Backbone_VIT_Epoch_1185_Batch_45020_Time_2024-09-26-03-26_checkpoint.pth
    └── config.txt

Code is in run_sub.sh

bash scripts/run_sub.sh

test_sub.sh is the test code for our Face Transformer. You can test the pre-trained model use it.

bash scripts/test_sub.sh

Continual Forgetting and Single Step Forgetting

a. Continual Forgetting and Few-shot Continual Forgetting

We provide the code of all baselines and our methods GS-LoRA, GS-LoRA++ metioned in our paper.

In run_cl_forget.sh, LIRF, SCRUB, SCRUB-S, EWC, MAS, L2, DER, DER++, FDR, LwF, Retrain and GS-LoRA, GS-LoRA++ (Main Table)

b. Single-step forgetting and Few-shot single-step forgetting

For baseline methods, you can still use run_cl_forget.sh and change --num_tasks to 1. (There are examples in run_cl_forget.sh.)

For single step GS-LoRA, we recommend you to use run_forget.sh.

c. Continual Forgetting on ImageNet100

For continual forgetting on ImageNet100 dataset, please use run_cl_forget_image.sh.

d. Missing class forgetting

Please use run_cl_forget_open.sh.

Citation

If you find this project useful in your research, please consider citing:

@article{zhao2024continual,
  title={Continual Forgetting for Pre-trained Vision Models},
  author={Zhao, Hongbo and Ni, Bolin and Wang, Haochen and Fan, Junsong and Zhu, Fei and Wang, Yuxi and Chen, Yuntao and Meng, Gaofeng and Zhang, Zhaoxiang},
  journal={arXiv preprint arXiv:2403.11530},
  year={2024}
}

@article{zhao2025practical,
  title={Practical Continual Forgetting for Pre-trained Vision Models},
  author={Zhao, Hongbo and Zhu, Fei and Ni, Bolin and Zhu, Feng and Meng, Gaofeng and Zhang, Zhaoxiang},
  journal={arXiv preprint arXiv:2501.09705},
  year={2025}
}

Contact

Please contact us or post an issue if you have any questions.

Acknowledgement

This work is built upon the zhongyy/Face-Transformer: Face Transformer for Recognition (github.com)

License

This project is released under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
.vscode		.vscode
baselines		baselines
image/README		image/README
scripts		scripts
test		test
train		train
util		util
vit_pytorch_face		vit_pytorch_face
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
engine.py		engine.py
engine_cl.py		engine_cl.py
image_iter.py		image_iter.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Continual Forgetting for Pre-trained Vision Models (CVPR 2024)

Method

Experimental results

Single-step Forgetting

Continual Forgetting

Practical forgetting (missing class settings)

Getting Started

Installation

a. Clone this repository

b. Install the environment

c. Prepare the datasets

c.1 CASIA-100 dataset

c.2 ImageNet100 dataset

c.3 Final File structure

Pretrain a Face Transformer

Continual Forgetting and Single Step Forgetting

a. Continual Forgetting and Few-shot Continual Forgetting

b. Single-step forgetting and Few-shot single-step forgetting

c. Continual Forgetting on ImageNet100

d. Missing class forgetting

Citation

Contact

Acknowledgement

License

About

Releases

Packages

Languages

License

bjzhb666/GS-LoRA

Folders and files

Latest commit

History

Repository files navigation

Continual Forgetting for Pre-trained Vision Models (CVPR 2024)

Method

Experimental results

Single-step Forgetting

Continual Forgetting

Practical forgetting (missing class settings)

Getting Started

Installation

a. Clone this repository

b. Install the environment

c. Prepare the datasets

c.1 CASIA-100 dataset

c.2 ImageNet100 dataset

c.3 Final File structure

Pretrain a Face Transformer

Continual Forgetting and Single Step Forgetting

a. Continual Forgetting and Few-shot Continual Forgetting

b. Single-step forgetting and Few-shot single-step forgetting

c. Continual Forgetting on ImageNet100

d. Missing class forgetting

Citation

Contact

Acknowledgement

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages