-
Notifications
You must be signed in to change notification settings - Fork 511
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature]: Support huggingface transformers LLM model #335
Comments
You can try to use the GPTCache api, a simple example like:
|
Yes, you can use the
if you use the langchain, you can also use it like:
|
@Zjq9409 If there isn't other question, i will close the issue |
Why is the last question is cached? |
because the |
Does it support huggingface conversation caching? |
About the conversation situation, it depends on how to use the llm. If you can give all the conversation info to the GPTCache, it will work finely, like the the messages of openai's |
I used chatglm to generate chat and need to cache it, but got an error.
|
@Zjq9409 From the error stack, can you try to run directly the llm, like:
because i guess the error is caused by the empty text in the |
still report the same problem. |
@Zjq9409 if there is the same problem when you run the code:
Looks like it should not be caused by GPTCache and it's caused the llm model. |
Actually, the usage method is: |
@Zjq9409 yap |
when the cache is empty, it will call the origin llm model to get the answer, and then the answer will be saved to cache. In the next time, you will get the answer from the cache when you request a similar request. |
@Zjq9409 Is your problem solved? If you want to use the huggingface transformers LLM model, you can use the GPTCache api. If you encounter other problems, you can open a new issue. |
I also encountered this problem, how to solve this problem |
@iunique hi, you can open a new issue and describe your problem. |
Is your feature request related to a problem? Please describe.
Can huggingface LLM model chat caching be support?
Describe the solution you'd like.
No response
Describe an alternate solution.
No response
Anything else? (Additional Context)
No response
The text was updated successfully, but these errors were encountered: