Linformer: Self-Attention with Linear Complexity (Wang et al., 2020)

This example contains code to train Linformer models as described in our paper Linformer: Self-Attention with Linear Complexity.

Training a new Linformer RoBERTa model

You can mostly follow the RoBERTa pretraining README, but replace the architecture with --arch linformer_roberta_base in your training command.

Citation

If you use our work, please cite:

@article{wang2020linformer,
  title={Linformer: Self-Attention with Linear Complexity},
  author={Wang, Sinong and Li, Belinda and Khabsa, Madian and Fang, Han and Ma, Hao},
  journal={arXiv preprint arXiv:2006.04768},
  year={2020}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Linformer: Self-Attention with Linear Complexity (Wang et al., 2020)

Training a new Linformer RoBERTa model

Citation

Files

README.md

Latest commit

History

README.md

File metadata and controls

Linformer: Self-Attention with Linear Complexity (Wang et al., 2020)

Training a new Linformer RoBERTa model

Citation