Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provide support for more than two models and provide a training guide. #23

Open
BrainSlugs83 opened this issue Jul 11, 2024 · 4 comments

Comments

@BrainSlugs83
Copy link

It looks like this only supports two models, a strong and a weak model. But there are other things to consider like if privacy is a concern, or if the question is math heavy, or if the question has a visual element, etc.

Why not have a RouteLLM that could route to several arbitrary models (including local, self-hosted, or models as a service like GPT4).

And provide some example training scripts and/or a training guide that we could use to fine tune this.

@iojw
Copy link
Collaborator

iojw commented Jul 12, 2024

Hi, thanks for raising this! That's definitely something we've been thinking about for while.

We did our initial research focusing on only 2 models as a start, but more research is required to build routers that work well with multiple models.

@jonpsy
Copy link

jonpsy commented Jul 28, 2024

Hey @iojw , would the team need some help in this regard? I could help with benchmarks and reporting results and/or write code to extend this feature. What do you say?

@bitnom
Copy link
Contributor

bitnom commented Aug 6, 2024

Hi, thanks for raising this! That's definitely something we've been thinking about for while.

We did our initial research focusing on only 2 models as a start, but more research is required to build routers that work well with multiple models.

This is the direction I want to go personally. Please share your thoughts on testing and implementation. Are you already working on it or not yet?

@villqrd
Copy link

villqrd commented Aug 7, 2024

@bitnom currently working on more than 2 models, you can see some of our results and read more about it here: https://tryplurally.com/.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants