Infor-Coef

Overview

The prevalence of Transformer-based pre-trained language models (PLMs) has led to their wide adoption for various natural language processing tasks. However, their excessive overhead leads to large latency and computational costs. The statically compression methods allocate fixed computation to different samples, resulting in redundant computation. The dynamic token pruning method selectively shortens the sequences but are unable to change the model size and hardly achieve the speedups as static pruning. In this paper, we propose a model accelaration approaches for large language models that incorporates dynamic token downsampling and static pruning, optimized by the information bottleneck loss. Our model, Infor-Coef, achieves an 18x FLOPs speedup with an accuracy degradation of less than 8% compared to BERT. This work provides a promising approach to compress and accelerate transformer-based models for NLP tasks.

Run Infor-Coef

Create a conda virtual environment and activate it

conda create --name infor_coef --file requirements.txt
conda activate infor_coef

Download the pruned CoFi models (or train it from scratch)
Modify the training parameters in action.sh and run it.

Training Parameters:

Our script supports only sigle-GPU training. The parameters are as follows:

TASK: the task to train, including mrpc,sst-2, mnli, qnli
sparsity: the sparsity of the pruned model.
model_name_or_path: the path of the pruned model.
CUDA: the GPU id to use
NORM: the norm-based penalty parameter of the information bottleneck loss
ENTRO: the entropy regularization parameter of the information bottleneck loss
EPOCHs: the number of training epochs
LR: the learning rate
bsz: the batch size
skim: the hyper-parameter of the dynamic token skim in Transkimmer. Set to 0 in our model.

Evaluate Infor-Coef

Run eval.sh to evaluate the pruned model on the corresponding task. The parameters are the same as action.sh.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
__pycache__		__pycache__
module		module
scripts		scripts
trainer		trainer
utils		utils
__init__.py		__init__.py
action.sh		action.sh
args.py		args.py
evaluate.py		evaluate.py
readme.md		readme.md
requirements.txt		requirements.txt
run_glue_vanilla.py		run_glue_vanilla.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Infor-Coef

Overview

Run Infor-Coef

Training Parameters:

Evaluate Infor-Coef

About

Releases

Packages

twwwwx/Infor-Coef

Folders and files

Latest commit

History

Repository files navigation

Infor-Coef

Overview

Run Infor-Coef

Training Parameters:

Evaluate Infor-Coef

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages