Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add extract evals for stagehand.page #331

Merged
merged 3 commits into from
Dec 23, 2024
Merged

Conversation

kamath
Copy link
Contributor

@kamath kamath commented Dec 23, 2024

evals that build off of #329 on stagehand.page instead of stagehand for extract

Copy link

changeset-bot bot commented Dec 23, 2024

⚠️ No Changeset found

Latest commit: 95a68b8

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

@kamath kamath requested a review from seanmcguire12 December 23, 2024 06:22
@kamath kamath marked this pull request as ready for review December 23, 2024 06:27
Copy link
Collaborator

@seanmcguire12 seanmcguire12 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

approved pending CI

@kamath kamath merged commit 6751c8e into v2 Dec 23, 2024
9 checks passed
kamath added a commit that referenced this pull request Dec 24, 2024
* Use CI on v2 branch

* branch

* add docs, move scoring functions to scoring.ts, move experiment naming to utils.ts

* add initStagehand.ts

* break up index.evals.ts and utils into smaller files

* export LogLineEval

* typing

* follow StagehandConfig pattern

* choose api key based on model name

* stagehand.act -> page.act (#326)

* need to actually move to act to page now

* move act -> page

* fix e2e

* fix tests

* readme

* changeset

* package json and changeset

* don't fail on combo evals

* Add act evals on `stagehand.page` (#328)

* move act evals to stagehand.page

* add basic act and make act necessary in type

* move extract and observe to page (#329)

* move act evals to stagehand.page

* add basic act and make act necessary in type

* move extract and observe

* example

* changeset

* More playwright tests (#330)

* add docs, move scoring functions to scoring.ts, move experiment naming to utils.ts

* add initStagehand.ts

* break up index.evals.ts and utils into smaller files

* export LogLineEval

* typing

* follow StagehandConfig pattern

* choose api key based on model name

* Use CI on v2 branch

* branch

* stagehand.page tests

* dont run on BB

* prettier

* pls dont fail

* headless

---------

Co-authored-by: Anirudh Kamath <[email protected]>

* add extract evals for stagehand.page (#331)

* add extract evals for stagehand.page

* fix typign

* smh i didn't actually run extract

* add observe page evals (#332)

* change stagehand.observe to stagehand.page.observe in evals

* changeset

* Browsercontext playwright tests (#334)

* add docs, move scoring functions to scoring.ts, move experiment naming to utils.ts

* add initStagehand.ts

* break up index.evals.ts and utils into smaller files

* export LogLineEval

* typing

* follow StagehandConfig pattern

* choose api key based on model name

* Use CI on v2 branch

* branch

* BrowserContext tests

* file path

---------

Co-authored-by: Anirudh Kamath <[email protected]>

* changeset minor

* ci yml

---------

Co-authored-by: seanmcguire12 <[email protected]>
Co-authored-by: Sean McGuire <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants