DM-CLIP

DM-CLIP: Knowledge Distillation Transformer to Mamba for efficient CLIP

Abstract

This study addresses the challenge of deploying Contrastive Learning-based CLIP models, which learn the relationship between images and text, in resource-constrained environments due to their high computational complexity and large model size. To overcome this, we propose an approach that enhances the performance of Mamba-based image encoders by applying Knowledge Distillation from Transformer-based ViT models. Experimental results show that the Mamba-based encoder reduces image encoder latency by 49.58% and overall model latency by 40.82%, with only a 0.12% performance loss. Additionally, it demonstrates 6.6% and 19.4% improvements on the SVHN and EuroSAT datasets, respectively, showcasing strengths in sequential pattern processing and high-resolution spatial information learning. This study validates that the lightweight CLIP encoder can be effectively utilized in mobile and edge device environments and suggests future research directions for developing Mamba-based text encoders and enhancing knowledge distillation techniques.

setup

conda create -n clipenv python=3.10
conda activate clipenv
pip install -r requirements.txt

git clone https://github.com/NVlabs/MambaVision.git
cd MambaVision
pip install -e .
cd ..

bash download_imagenet.sh

run

bash run_datacompdr12m.sh

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.vscode		.vscode
__pycache__		__pycache__
benchmark		benchmark
docs		docs
imagenet_validation		imagenet_validation
models		models
preprocessing		preprocessing
results		results
src		src
.gitignore		.gitignore
DM-CLIP_ Transformer에서 Mamba로의 Knowledge Distillation을 통한 효율적인 CLIP.pdf		DM-CLIP_ Transformer에서 Mamba로의 Knowledge Distillation을 통한 효율적인 CLIP.pdf
README.md		README.md
benchmark.sh		benchmark.sh
datacompdr12m.json		datacompdr12m.json
get_pretrained_models.sh		get_pretrained_models.sh
hessian.ipynb		hessian.ipynb
hessian.py		hessian.py
plot.ipynb		plot.ipynb
plot.png		plot.png
plot.py		plot.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run_mobileclip.sh		run_mobileclip.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DM-CLIP

Abstract

setup

run

About

Releases

Packages

Contributors 2

Languages

deepdaiv-multimodal/24su-DM-CLIP

Folders and files

Latest commit

History

Repository files navigation

DM-CLIP

Abstract

setup

run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages