PromptBERT: Prompt makes BERT Better at Sentence Embeddings

Update: We have extended our prompt-based method to LLMs in scaling_sentemb. [2023/08/01]

Overview

We propose PromptBERT, a novel contrastive learning method for learning better sentence representation. We firstly analyze the drawback of current sentence embedding from original BERT and find that it is mainly due to the static token embedding bias and ineffective BERT layers. Then we propose the first prompt-based sentence embeddings method and discuss two prompt representing methods and three prompt searching methods to make BERT achieve better sentence embeddings. Moreover, we propose a novel unsupervised training objective by the technology of template denoising, which substantially shortens the performance gap between the supervised and unsupervised settings. Extensive experiments show the effectiveness of our method. Compared to SimCSE, PromptBert achieves 2.29 and 2.58 points of improvement based on BERT and RoBERTa in the unsupervised setting.

Results on STS Tasks

Model	STS12	STS13	STS14	STS15	STS16	STSb	SICK-R	Avg.
royokong/unsup-PromptBERT	71.56±0.18	84.58±0.22	76.98±0.26	84.47±0.24	80.60±0.21	81.60±0.22	69.87±0.40	78.54±0.15
royokong/unsup-PromptRoBERTa	73.94±0.90	84.74±0.36	77.28±0.41	84.99±0.25	81.74±0.29	81.88±0.37	69.50±0.57	79.15±0.25
royokong/sup-PromptBERT	75.48	85.59	80.57	85.99	81.08	84.56	80.52	81.97
royokong/sup-PromptRoBERTa	76.75	85.93	82.28	86.69	82.80	86.14	80.04	82.95

To evaluate the above models, please run the following script,

bash eval_only.sh [unsup-bert|unsup-roberta|sup-bert|sup-roberta]

Setup

Install Dependencies

pip install -r requirements.txt

Download Data

cd SentEval/data/downstream/
bash download_dataset.sh
cd -
cd ./data
bash download_wiki.sh
bash download_nli.sh
cd -

Prompt with unfine-tuned BERT

bert-base-uncased with prompt

./run.sh bert-prompt

bert-base-uncased with optiprompt

./run.sh bert-optiprompt

Train PromptBERT and PromptRoBERTa

unsupervised

SEED=0
./run.sh unsup-roberta $SEED

SEED=0
./run.sh unsup-bert $SEED

supervised

./run.sh sup-roberta

./run.sh sup-bert

Static token embedding with removing embedding biases

robert-base, bert-base-cased and robert-base-uncased

./run.sh roberta-base-static-embedding-remove-baises
./run.sh bert-base-cased-static-embedding-remove-baises
./run.sh bert-base-uncased-static-embedding-remove-baises

Calculation of anisotropy

To calculate the sentence anisotropy

EXP=bert-base-uncased | bert-base-uncased-static-embedding | bert-base-uncased-static-embedding-remove-baises 
./run.sh calc-anisotropy-${EXP}

Acknowledgement

Our Code is based on SimCSE

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
SentEval		SentEval
data		data
prompt_bert		prompt_bert
README.md		README.md
eval_only.sh		eval_only.sh
evaluation.py		evaluation.py
requirements.txt		requirements.txt
run.sh		run.sh
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PromptBERT: Prompt makes BERT Better at Sentence Embeddings

Overview

Results on STS Tasks

Setup

Prompt with unfine-tuned BERT

Train PromptBERT and PromptRoBERTa

Static token embedding with removing embedding biases

Calculation of anisotropy

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Languages

kongds/Prompt-BERT

Folders and files

Latest commit

History

Repository files navigation

PromptBERT: Prompt makes BERT Better at Sentence Embeddings

Overview

Results on STS Tasks

Setup

Prompt with unfine-tuned BERT

Train PromptBERT and PromptRoBERTa

Static token embedding with removing embedding biases

Calculation of anisotropy

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages