Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sibyl-like WebAgent Tool #1891

Closed
wants to merge 24 commits into from
Closed

Sibyl-like WebAgent Tool #1891

wants to merge 24 commits into from

Conversation

fatihozturkh2o
Copy link
Contributor

@fatihozturkh2o fatihozturkh2o commented Nov 1, 2024

Still wip, doing tests...

So far:

  • New web_agent_tool added, (Started from Sibyl())

    • Disabled caching and code_execution logic coming from native Sibly code (because we are handling those in our agents)
    • Changed some of the Converters a bit
      • WikipediaConverter was not returning tabs when parsing the page content, hence, the LLM was not seeing how to visit View History for Wiki pages before. Now I also allowed them.
    • Improved/changed prompt files based on our needs
  • web_agent_tool is capable of :

    • doing complex web search with a kept-state and operations like visiting pages, doing page_up/downs, ctrl+f, downloading documents
    • handling specific web page formats like Wiki, Youtube, etc. (see them in converters)
    • For Youtube videos, realised an important issue that highly effects Youtube GAIA questions. Please see # TODO comment under YouTubeConverter. TL;DR: YouTube blocks returning video transcriptions for requests coming from non-statice IDs (prod envs, cloud, dockers etc. See more: TranscriptsDisabled But it's not disabled (works locally, fails on Cloud machine) jdepoix/youtube-transcript-api#303) This is why, I'm able to get answers for GAIA YouTube questions in my local but not in my remote machine.
  • To be able to prioritize selection of this tool, disabled our google + bing search tools.

    • We might need to move all web related tools under this WebAgent and let it select when needed (possible next steps)
  • Need to test questions with attached files

  • Need to test download_file & file paths for the web_agent tool

  • Web Agent Steps streaming needs to be double-checked (to see if it's ok on h2ogpte side)

  • Either now, or later, choose_tool_prompt needs to be improved for even better web browsing

@fatihozturkh2o fatihozturkh2o self-assigned this Nov 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants