feat: add script-based auto-evaluation with Streamlit analysis #114

error9098x · 2024-12-19T20:34:27Z

This PR introduces a script-based auto-evaluation system designed to test different versions of ORAssistant and compare them with other OpenAI or Google LLM models. The script is user-friendly and includes Streamlit-based visualization for the final results.

Signed-off-by: error9098x <[email protected]>

Signed-off-by: Jack Luar <[email protected]>

feat: add script-based auto-evaluation with Streamlit analysis

e6ee87a

Signed-off-by: error9098x <[email protected]>

error9098x requested a review from luarss December 20, 2024 03:20

fix mypy/ruff checks

2ba8451

Signed-off-by: Jack Luar <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add script-based auto-evaluation with Streamlit analysis #114

feat: add script-based auto-evaluation with Streamlit analysis #114

error9098x commented Dec 19, 2024

feat: add script-based auto-evaluation with Streamlit analysis #114

Are you sure you want to change the base?

feat: add script-based auto-evaluation with Streamlit analysis #114

Conversation

error9098x commented Dec 19, 2024