GitHub - layer6ai-labs/DiMS: Code for the ACL'23 paper "DiMS: Distilling Multiple Steps of Iterative Non-Autoregressive Transformers for Machine Translation"

ACL'23 DiMS: Distillig Multiple Steps of Iterative Non-Autoregressive Transformer

[paper]

Authors: Sajad Norouzi, Rasa Hosseinzadeh, Felipe Pérez, Maksims Volkovs

Introduction

This repository contains a full implementation of the DiMS implemented with the fairseq library.

Environment

The python code is developed and tested on the following environment:

Python 3.7.6
Pytorch 1.9.0

Experiments were run on an IBM server with 160 POWER9 CPUs, 600GB RAM and 4 Tesla V100 GPUs

The following command needs to be run in the root of the project before using the repo:

pip install -e ./

Dataset

For the WMT'14 En-De and WMT'16 En-Ro datasets refer to the fairseq's instructions here

Running The Code

The script ./train_cmclc_ende.sh can be used to train a teacher. The defualt uses 4 GPUS and should be edited as necessary. The path to dataset should be provided in the first line. The path for checkpoints and logging should be changed in the script with --save-dir, --tensorboard-logdir and --log-file. Note that the provided directories should exist before running the script.
To distill use exp_manager.py. Example settings are provided in ExpSetting directory. These scripts should be edited to containt correct path to dataset and teacher checkpoints. Run like python exp_manager.py cmlmc_ende 0,1,2,3 ,where cmlmc_ende.json is inside ExpSetting directory.
To evaluate any model using test set run ./eval_teacher_wmt.sh. The arguments are as follows:

./eval_teacher_wmt.sh PATH_TO_MODEL PATH_TO_DATA NUMBER_OF_STEPS LENGTH_PREDICTOR_BEAM [--ctc]

where --ctc is optional for evaluating Imputer models.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.circleci		.circleci
ExpSetting		ExpSetting
docs		docs
examples		examples
fairseq		fairseq
fairseq_cli		fairseq_cli
scripts		scripts
tests		tests
.gitignore		.gitignore
.gitmodules		.gitmodules
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
eval_teacher_wmt.sh		eval_teacher_wmt.sh
exp_manager.py		exp_manager.py
hubconf.py		hubconf.py
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg
setup.py		setup.py
train.py		train.py
train_cmlmc_ende.sh		train_cmlmc_ende.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ACL'23 DiMS: Distillig Multiple Steps of Iterative Non-Autoregressive Transformer

Introduction

Environment

Dataset

Running The Code

About

Releases

Packages

Contributors 2

Languages

License

layer6ai-labs/DiMS

Folders and files

Latest commit

History

Repository files navigation

ACL'23 DiMS: Distillig Multiple Steps of Iterative Non-Autoregressive Transformer

Introduction

Environment

Dataset

Running The Code

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages