add action message #79

filip-michalsky · 2024-09-27T00:35:03Z

why

This is second, reworked attempt.

Paul mentioned a vision where stagehand will be a tool we can pass to an LLM to perform an action on the web. In that sense, we DO want to return success/ failure signal and a reason for the failure so that the agent/developer can heal their pipelines (either manually or agentically).

what changed

we are returning a message from a finished act() method. This is currently not utilized in evals so we need to add evals to test it.

test plan

need to add evals for this.

filip-michalsky · 2024-09-27T00:39:20Z

My thinking behind this updated API design:

By return a message from act() we do not break anything in existing evals. It's a simply optional return value which developer can use to retry actions and perhaps change the prompt until the action gets done. It can also help with the prompt optimizer where an another LLM can change the action prompt if previous action failed, etc.

filip-michalsky · 2024-10-02T12:59:37Z

This needs to get fixed - I resolved the merge conflicts but now missing some banalyzer dependencies. Also homedepot eval is still not passing.

merge main

filip-michalsky · 2024-10-04T02:51:10Z

lib/dom/process.ts

@@ -16,7 +16,7 @@ async function processElements(chunk: number) {
  const chunkHeight = viewportHeight * chunk;
  const offsetTop = chunkHeight;

-  window.scrollTo(0, offsetTop);
+  window.scrollTo({ top: offsetTop, left: 0, behavior: 'smooth' });


I added "smooth" to the scrolling behavior, the hypothesis is that it reduces hard reloads of the site for slow sites but its just based on the one eval (homedepot)

lib/index.ts

add action message

f719e19

This was referenced Sep 27, 2024

[WIP - do not merge] Fm/actions not found #78

Closed

Determine experience when actions are not found #46

Closed

Filip Michalsky added 3 commits September 26, 2024 20:41

update

ef10ef7

add eval for failed action

4c2aba7

added two more evals

4159d8c

filip-michalsky requested review from pkiv and navidkpr September 29, 2024 00:30

Filip Michalsky and others added 8 commits September 28, 2024 22:55

fix homedepot

2ce2785

update homedepot

ffeb6e3

update

6657fb9

smooth scrolling

8c5269a

search bar homedepot fix

585f326

update

e589e0e

added smooth scrolling

0adfec9

Merge branch 'main' into fm/return-no-action

e52ceae

filip-michalsky added 3 commits October 2, 2024 21:37

add main

e523b44

Merge branch 'main' into fm/return-no-action

8c80a30

merge main

update playground

cd37700

filip-michalsky commented Oct 4, 2024

View reviewed changes

lib/index.ts Show resolved Hide resolved

navidkpr approved these changes Oct 4, 2024

View reviewed changes

pkiv merged commit 7edb817 into main Oct 4, 2024
1 check passed

filip-michalsky deleted the fm/return-no-action branch October 6, 2024 23:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add action message #79

add action message #79

Uh oh!

filip-michalsky commented Sep 27, 2024

Uh oh!

filip-michalsky commented Sep 27, 2024

Uh oh!

filip-michalsky commented Oct 2, 2024

Uh oh!

filip-michalsky Oct 4, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

add action message #79

add action message #79

Uh oh!

Conversation

filip-michalsky commented Sep 27, 2024

why

what changed

test plan

Uh oh!

filip-michalsky commented Sep 27, 2024

Uh oh!

filip-michalsky commented Oct 2, 2024

Uh oh!

filip-michalsky Oct 4, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!