Skip to content

smilni/dialog_graphs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This is a repository with automatic discourse annotation of Daily Dialog dataset. It contains 991 dialogs with each utterance segmented into sentences. Annotation was performed with the use of ChatGPT. Currently, dd_annotation_results.tsv contains 3 kinds of annotation:

  • open goals annotation with no set classes. Prompt
  • annotation according to (Goals-Plans-Action) theory. Prompt
  • annotation according to (DuRecDial 2.0) annotation scheme. In this case, we also attempted to track and annotated goal states (with little success, though). Prompt

Other files contain analysis of the annotation results, specifically, open goals and DuRecDial-style goals.

With dd_analysis.py, you can see the goal-annotated dialogs in a user-friendly streamlit interface.

Screenshot 2023-03-24 at 18 56 55

With dd_visualisation.ipynb, you can take a look at a larger-scale dataset analysis, with all annotated dialogs presented in graphs using networkx.

Screenshot 2023-03-24 at 19 02 54

dd_processing.ipynb is a technical file for preprocessing the annotated data, you don't have to run it. The results of this preprocessing are found in dialog_data/processed.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published