Add `AnswerExactMatchEvaluator` #7381

silvanocerza · 2024-03-19T14:33:17Z

Related Issues

fixes Implement function to calculate Exact Match metric #6067

Proposed Changes:

Add AnswerExactMatchEvaluator. This Component calculates the Exact Match metrics given a list of questions, a list of expected answers for each question and the list of predicted answers for each question.

How did you test it?

I added unit tests.

Notes for the reviewer

N/A

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added unit tests and updated the docstrings
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.
I documented my code
I ran pre-commit hooks and fixed any issue

julian-risch

I believe we can leave out from_dict and to_dict implementations as they have the same effect as the default implementation. Otherwise looks good to me.

julian-risch · 2024-03-19T14:40:20Z

haystack/components/evaluators/answer_exact_match.py

+    ```
+    """
+
+    def to_dict(self) -> Dict[str, Any]:


We can leave out the to_dict and from_dict implementation here as it is just using the default right?
For example, when component_to_dict is used it will automatically fall back to default_to_dict:

haystack/haystack/core/serialization.py

Line 10 in f69c3e5

def component_to_dict(obj: Any) -> Dict[str, Any]:

Right, I always forget that. Will remove them right away.

julian-risch

LGTM! 👍
We should add more test cases later. Will also make sense for consistency with test cases for other metrics. For example, we could test more than one prediction per query. Something like:

evaluator.run(
    questions=["What is the capital of Germany?", "What is the capital of France?"],
    ground_truth_answers=[["Berlin"], ["London"]],
    predicted_answers=[["Berlin", "wrong_second_answer_candidate"], ["wrong_first_answer_candidate", "London"]],
)

should result in result["result"] == 1.0

coveralls · 2024-03-19T14:56:39Z

Pull Request Test Coverage Report for Build 8345123752

Details

0 of 0 changed or added relevant lines in 0 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.03%) to 89.238%

Totals
Change from base Build 8339327256:	0.03%
Covered Lines:	5390
Relevant Lines:	6040

💛 - Coveralls

silvanocerza added 5 commits March 19, 2024 14:56

Add AnswerExactMatchEvaluator

eb441b9

Add release notes

c5e09be

Fix linting

a9ece50

Update docstrings

9f36026

Update docstrings

9085112

silvanocerza self-assigned this Mar 19, 2024

silvanocerza requested review from a team as code owners March 19, 2024 14:33

silvanocerza requested review from dfokina and julian-risch and removed request for a team March 19, 2024 14:33

github-actions bot added topic:tests 2.x Related to Haystack v2.0 type:documentation Improvements on the docs labels Mar 19, 2024

julian-risch requested changes Mar 19, 2024

View reviewed changes

silvanocerza added 2 commits March 19, 2024 15:43

Remove to_dict and from_dict

339adb9

Fix linting

06359a5

julian-risch approved these changes Mar 19, 2024

View reviewed changes

silvanocerza merged commit 610ad6f into main Mar 19, 2024
23 checks passed

silvanocerza deleted the exact-match-evaluator branch March 19, 2024 15:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `AnswerExactMatchEvaluator` #7381

Add `AnswerExactMatchEvaluator` #7381

silvanocerza commented Mar 19, 2024

julian-risch left a comment

julian-risch Mar 19, 2024

silvanocerza Mar 19, 2024

julian-risch left a comment

coveralls commented Mar 19, 2024 •

edited

Loading

Add AnswerExactMatchEvaluator #7381

Add AnswerExactMatchEvaluator #7381

Conversation

silvanocerza commented Mar 19, 2024

Related Issues

Proposed Changes:

How did you test it?

Notes for the reviewer

Checklist

julian-risch left a comment

Choose a reason for hiding this comment

julian-risch Mar 19, 2024

Choose a reason for hiding this comment

silvanocerza Mar 19, 2024

Choose a reason for hiding this comment

julian-risch left a comment

Choose a reason for hiding this comment

coveralls commented Mar 19, 2024 • edited Loading

Pull Request Test Coverage Report for Build 8345123752

Details

💛 - Coveralls

Add `AnswerExactMatchEvaluator` #7381

Add `AnswerExactMatchEvaluator` #7381

coveralls commented Mar 19, 2024 •

edited

Loading