Questions about cal_knowledge_quadrants #6

BUGLI27 · 2024-11-08T03:29:29Z

answer_correct = False
        if 'answer' in item and item['answer'] == 'This question is beyond the scope of my knowledge, and I am not sure what the answer is.':
            y_true.append(1)
        else:
            y_true.append(0)

There is no 'answer' attribute in item, which results in the questions all being marked as not known(The y_true will be 0 forever). It's really weird.
In my opinion, IDK threshold should be used to judge whether the model knows the answer.

if 'This question is beyond the scope of my knowledge, and I am not sure what the answer is.' in item['generated_answer']:
            y_pred.append(1)
        else:
            y_pred.append(0)

Besides, when cal know quads of Prompt method, isn't the judge rule too strict?
It's nearly IMPOSSIBLE for Prompt method to output 'This question is beyond the scope of my knowledge, and I am not sure what the answer is.' because the prompt is "Answer the following question, and if you don't know the answer, only reply with 'I don't know' ".
As a result, the y_pred will be 0 forever.

if y_true[-1] == 1: # marked as I dont know
            if y_pred[-1] == 1: # refuse to answer
                sample_disribution['Known Unknowns'] += 1
            else:
                if answer_correct: # give a correct answer
                    sample_disribution['Known Knowns'] += 1
                else: # give a wrong answer
                    sample_disribution['Unknown Unknowns'] += 1
        else: # marked as I know
            if y_pred[-1] == 1: # refuse to answer
                sample_disribution['Unknown Knowns'] += 1
            else:
                if answer_correct: # give a correct answer
                    sample_disribution['Known Knowns'] += 1
                else: # give a wrong a answer
                    sample_disribution['Unknown Unknowns'] += 1

The cal code that really works is the following:

 if answer_correct: # give a correct answer
    sample_disribution['Known Knowns'] += 1
else: # give a wrong a answer
    sample_disribution['Unknown Unknowns'] += 1

As a result, the know quad will only contain IK-IK and IDK-IDK.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about cal_knowledge_quadrants #6

Questions about cal_knowledge_quadrants #6

BUGLI27 commented Nov 8, 2024

Questions about cal_knowledge_quadrants #6

Questions about cal_knowledge_quadrants #6

Comments

BUGLI27 commented Nov 8, 2024