-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
More AI docs updates #12515
More AI docs updates #12515
Conversation
Signed-off-by: Marcel Klehr <[email protected]>
…manually Signed-off-by: Marcel Klehr <[email protected]>
…ion2 Signed-off-by: Marcel Klehr <[email protected]>
This app requires underlying Large language models to support tool calling. The default model in *llm2* does *not* support tool calling. Instead we recommend: | ||
|
||
* Qwen 2.5 8B or higher | ||
* Watt Tool 8B or higher |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe we can switch the default model in llm2 in favour of one of these models.
nothing again this PR though.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
All good except a few small change requests.
I could build after reinstalling the dependencies.
Scaling | ||
------- | ||
|
||
It is currently not possible to scale this app, we are working on this. Based on our calculations an instance has a rough capacity of 120 image requests per hour (each user request can be for multiple images). However, this number is based on theory and we do appreciate real-world feedback on this. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the testing setup could be mentioned here for a rough idea
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you elaborate?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I mean some details of the machine on which 120 images/h was achieved, the gpu and cpu perhaps.
Co-authored-by: Julien Veyssier <[email protected]> Signed-off-by: Marcel Klehr <[email protected]>
I have python 3.13 on this machine, maybe it's related to that :/ |
Signed-off-by: Marcel Klehr <[email protected]>
Signed-off-by: Marcel Klehr <[email protected]>
🖼️ Screenshots
Sorry, can't get it to build for some reason.