Skip to content

PubMedQA: A Dataset for Biomedical Research Question Answering

License

Notifications You must be signed in to change notification settings

WangBingJian233/pubmedqa

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PubMedQA

Download

PQA-L is already in ./data/

PQA-U

PQA-A

Split the dataset

After downloading PQA-A and PQA-L as ori_pqaa.json and ori_pqau.json in the ./data/, enter the ./preprocess/ directory and split the dataset:

cd preprocess
python split_dataset.py pqaa
python split_dataset.py pqal

Please be aware that there is no offical code for splitting PQA-U.

Evaluation and submission

To evaluate your model predictions, please prepare the results in a json format where the key is PMID and value is one of "yes", "no", and "maybe". Run the following script to get the performance:

python evaluation.py PREDICTIONS_PATH

To submit a system on the Leaderboard, please send an email that contains the model predictions and a brief description of the system to Qiao Jin via [email protected].

Human performance

After splitting the PQA-L and having ./data/test_set.json, one can run the following script to get human performance:

python get_human_performance.py

Citation

If you use PubMedQA in your research, please cite our paper by:

@inproceedings{jin2019pubmedqa,
  title={PubMedQA: A Dataset for Biomedical Research Question Answering},
  author={Jin, Qiao and Dhingra, Bhuwan and Liu, Zhengping and Cohen, William and Lu, Xinghua},
  booktitle={Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)},
  pages={2567--2577},
  year={2019}
}

About

PubMedQA: A Dataset for Biomedical Research Question Answering

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%