Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add act evals on stagehand.page #328

Merged
merged 2 commits into from
Dec 23, 2024
Merged

Add act evals on stagehand.page #328

merged 2 commits into from
Dec 23, 2024

Conversation

kamath
Copy link
Contributor

@kamath kamath commented Dec 23, 2024

why

#326 moved act from stagehand to stagehand.page and deprecated stagehand.act. This changes evals to account for this change.

what changed

Evals now point to stagehand.page.act instead of stagehand.act

test plan

evals

Copy link

changeset-bot bot commented Dec 23, 2024

⚠️ No Changeset found

Latest commit: f6e165d

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

@kamath kamath changed the base branch from main to v2 December 23, 2024 00:55
@kamath kamath marked this pull request as ready for review December 23, 2024 00:56
@kamath kamath requested a review from seanmcguire12 December 23, 2024 00:57
Copy link
Collaborator

@seanmcguire12 seanmcguire12 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

approved pending act evals pass

@kamath
Copy link
Contributor Author

kamath commented Dec 23, 2024

act evals passed, merging

@kamath kamath merged commit 027c6a6 into v2 Dec 23, 2024
9 checks passed
@kamath kamath deleted the anirudh/add-act-evals branch December 23, 2024 02:41
kamath added a commit that referenced this pull request Dec 24, 2024
* Use CI on v2 branch

* branch

* add docs, move scoring functions to scoring.ts, move experiment naming to utils.ts

* add initStagehand.ts

* break up index.evals.ts and utils into smaller files

* export LogLineEval

* typing

* follow StagehandConfig pattern

* choose api key based on model name

* stagehand.act -> page.act (#326)

* need to actually move to act to page now

* move act -> page

* fix e2e

* fix tests

* readme

* changeset

* package json and changeset

* don't fail on combo evals

* Add act evals on `stagehand.page` (#328)

* move act evals to stagehand.page

* add basic act and make act necessary in type

* move extract and observe to page (#329)

* move act evals to stagehand.page

* add basic act and make act necessary in type

* move extract and observe

* example

* changeset

* More playwright tests (#330)

* add docs, move scoring functions to scoring.ts, move experiment naming to utils.ts

* add initStagehand.ts

* break up index.evals.ts and utils into smaller files

* export LogLineEval

* typing

* follow StagehandConfig pattern

* choose api key based on model name

* Use CI on v2 branch

* branch

* stagehand.page tests

* dont run on BB

* prettier

* pls dont fail

* headless

---------

Co-authored-by: Anirudh Kamath <[email protected]>

* add extract evals for stagehand.page (#331)

* add extract evals for stagehand.page

* fix typign

* smh i didn't actually run extract

* add observe page evals (#332)

* change stagehand.observe to stagehand.page.observe in evals

* changeset

* Browsercontext playwright tests (#334)

* add docs, move scoring functions to scoring.ts, move experiment naming to utils.ts

* add initStagehand.ts

* break up index.evals.ts and utils into smaller files

* export LogLineEval

* typing

* follow StagehandConfig pattern

* choose api key based on model name

* Use CI on v2 branch

* branch

* BrowserContext tests

* file path

---------

Co-authored-by: Anirudh Kamath <[email protected]>

* changeset minor

* ci yml

---------

Co-authored-by: seanmcguire12 <[email protected]>
Co-authored-by: Sean McGuire <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants