Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

about evaluation #10

Open
Violettttee opened this issue Aug 28, 2024 · 2 comments
Open

about evaluation #10

Violettttee opened this issue Aug 28, 2024 · 2 comments

Comments

@Violettttee
Copy link

In addition to constraint ‘example’, is only gpt4 used for the evaluation of other constraint_type? Or are other models evaluated using both rule_based and gpt?

想请问一下这里面除了example以外的其他constraint_type的评估是只用了gpt4吗?还是说其他模型的评估既要用rule_based还要用gpt,双重打分?

@Violettttee
Copy link
Author

another question is that why "level0" envolved in the data file?since i see in the utils.py file,it has filterd data which 'level = 0'.
还有一个问题是为什么level0的数据被包含在了数据文件里面呢?我看到在utils.py中的convert_to_api_input函数只添加进了level大于0的数据。

@YJiangcm
Copy link
Owner

YJiangcm commented Sep 7, 2024

Hi, for your first question, the example constraint is evaluated by rule_based, and other constraints are evaluated by both rule_based and gpt.

For your second question, "level0" is used as additional information during gpt's evaluation. Please refer to Figure 4 in our paper.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants