English Question Answering System

Require:

pip install transformers
conda install -c anaconda importlib-metadata

Download pre-trained model before running:

from transformers import ElectraModel, ElectraTokenizerFast
tokenizer = ElectraTokenizerFast.from_pretrained('google/electra-small-discriminator')
model = ElectraModel.from_pretrained('google/electra-small-discriminator')

Data preprocessing:

Create a folder "dataset". Move all files into it. (official SQuAD 2.0 / Quoref / NewsQA / Drop / Medhop / Wikihop) Then run scripts in preprocessing/

How to run our baseline:

Run with ELECTRA_small, with only SQuAD 2.0 training set.

python train_baseline.py -c baseline-small -d small

Run with ELECTRA_base, with all datasets except SQuAD 2.0 dev set.

python train_baseline.py -c baseline-base -d normal

How to run advanced model:

Make sure you run baseline first, so you can see a "model_parameters.pth" file

Run with cross-attention decoder with ELECTRA_small, with only SQuAD 2.0 training set, with random seed 114514:

python train.py -c cross-attention -d small -s 114514

Run with match-attention decoder with ELECTRA_small, with all datasets except SQuAD 2.0 dev set, with using regression loss:

python train.py -c match-attention -d normal -rl

Run with CNN decoder with ELECTRA_base, with only SQuAD 2.0 training set, with using dynamic weight averaging:

python train.py -c cnn-span-large -d small -dw

How to evaluate:

Move unprocessed dev-squad2.0.json into directory: /evaluate and /evaluate/processed_dataset

Run evaluate.py, and run SQuAD official evaluate script.

Name		Name	Last commit message	Last commit date
Latest commit History 106 Commits
evaluate		evaluate
model		model
preprocessing		preprocessing
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
test.py		test.py
train-standard.sh		train-standard.sh
train.py		train.py
train_baseline.py		train_baseline.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

English Question Answering System

Require:

Download pre-trained model before running:

Data preprocessing:

How to run our baseline:

How to run advanced model:

How to evaluate:

About

Releases

Packages

Languages

Owen-Qin/English-Question-Answering-System

Folders and files

Latest commit

History

Repository files navigation

English Question Answering System

Require:

Download pre-trained model before running:

Data preprocessing:

How to run our baseline:

How to run advanced model:

How to evaluate:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages