Llm evaluation #117

Mazenh2003 · 2024-07-23T20:12:03Z

Senselab project integrates deepeval for evaluating conversations,
using an api.py script to interface with deep_eval.py,
which includes a custom ROUGE metric for comprehensive evaluation.
The ScriptLine class standardizes input data, and unit tests ensure accurate functionality,
making Senselab a robust wrapper for deepeval and other tools.

Integrating deepeval to senselab

… into llm_evaluation

batwood-1

Nice use of style and cleanliness, some quick points -

-Still failing checks
-We have a script_line data structure available already in utils/datastructures, is there a reason you are rewriting rather than importing this?
-write a file called metrics.py, with an abstract base class called Metric, define the abstractmethods you want to be inherited by the different implementations of Metric (see https://docs.python.org/3/library/abc.html), then define various implementations (rouge, etc)
-give some various options for calculating overall_score in evaluate_conversation, perhaps harmonic mean, etc

by the time of the review, all tests failed.

Mazenh2003 and others added 5 commits July 17, 2024 07:34

deepeval integration

1dd4781

Deepeval integration using custom metrics

de812d3

Merge branch 'main' into llm_evaluation

cd79a9f

LLM evaluation

2ce3b8b

Integrating deepeval to senselab

Merge branch 'llm_evaluation' of https://github.com/Mazenh2003/senselab…

6016d6d

… into llm_evaluation

batwood-1 requested changes Jul 23, 2024

View reviewed changes

fabiocat93 and others added 2 commits July 29, 2024 10:58

Merge branch 'main' into llm_evaluation

a70e2b8

deepeval integration

f9b204a

Mazenh2003 marked this pull request as draft July 29, 2024 21:12

Mazenh2003 marked this pull request as ready for review July 29, 2024 21:15

Merge branch 'main' into llm_evaluation

1615d6e

batwood-1 previously approved these changes Jul 29, 2024

View reviewed changes

Merge branch 'main' into llm_evaluation

7add581

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llm evaluation #117

Llm evaluation #117

Mazenh2003 commented Jul 23, 2024

batwood-1 left a comment

Llm evaluation #117

Are you sure you want to change the base?

Llm evaluation #117

Conversation

Mazenh2003 commented Jul 23, 2024

batwood-1 left a comment

Choose a reason for hiding this comment