Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Eval CI should fail if it detects a regression #95

Open
pkiv opened this issue Oct 3, 2024 · 0 comments
Open

Eval CI should fail if it detects a regression #95

pkiv opened this issue Oct 3, 2024 · 0 comments
Milestone

Comments

@pkiv
Copy link
Contributor

pkiv commented Oct 3, 2024

Our github actions eval CI should not be green if the change introduces a regression to the evals.

@kamath kamath added this to Stagehand Nov 29, 2024
@kamath kamath added this to the Evaluation milestone Nov 29, 2024
@kamath kamath moved this to Todo in Stagehand Nov 29, 2024
@kamath kamath removed the status in Stagehand Nov 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: No status
Development

No branches or pull requests

2 participants