Baselines for DSTC10 Track 2 - Task 1

In this task, we take TripPy model trained on the MultiWOZ 2.1 dataset as the official baseline.

If you want to publish experimental results with this baselines, please cite the following article:

@inproceedings{heck2020trippy,
    title = "{T}rip{P}y: A Triple Copy Strategy for Value Independent Neural Dialog State Tracking",
    author = "Heck, Michael and van Niekerk, Carel and Lubis, Nurul and Geishauser, Christian and
              Lin, Hsien-Chin and Moresi, Marco and Ga{\v{s}}i{\'c}, Milica",
    booktitle = "Proceedings of the 21st Annual Meeting of the Special Interest Group on Discourse and Dialogue",
    month = jul,
    year = "2020",
    address = "1st virtual meeting",
    publisher = "Association for Computational Linguistics",
    pages = "35--44",
}

The remainder of this document describes how to run the baseline model on the DSTC10 validation dataset.

How to run it

Train the TripPy model on the MultiWOZ 2.1 dataset following the instructions.
Copy the models into models directory.
Install the required python packages.

$ pip3 install -r requirements.txt

Run the model on the data (change the DATA_DIR parameter in dstc10-dst.infer.pipeline.sh, if required).

$ bash ./dstc10-dst.infer.pipeline.sh

Validate the structure and contents of the tracker output.

$ cd [DSTC10 TRACK 2 - TASK 1 ROOT]
$ python3 scripts/check_results.py --dataset val --dataroot data --outfile baseline/DSTC10_DST/DST_preds.json 
Found no errors, output file is valid

Evaluate the output.

$ python3 scripts/scores.py --dataset val --dataroot data --outfile baseline/DSTC10_DST/DST_preds.json --scorefile baseline.val.score.json

Print out the scores.

$ cat baseline.val.score.json | jq
{
  "joint_goal_accuracy": 0.005341880341880342,
  "slot": {
    "accuracy": 0.7056342780026991,
    "value_prediction": {
      "prec": 0.564648419500507,
      "rec": 0.299330695197616,
      "f1": 0.39125159642401036
    },
    "none_prediction": {
      "prec": 0.7434883885085057,
      "rec": 0.9609524678405255,
      "f1": 0.8383476599808978
    }
  }
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Baselines for DSTC10 Track 2 - Task 1

How to run it

Files

README.md

Latest commit

History

README.md

File metadata and controls

Baselines for DSTC10 Track 2 - Task 1

How to run it