[Feature] Any plan to support async and streaming? #233

hustxx · 2023-12-21T05:49:40Z

Is this your first time submitting a feature request?

I have searched the existing issues, and I could not find an existing issue for this feature
I am requesting a straightforward extension of existing functionality

Describe the feature

We will need async and streaming to integrate canopy into our app, do you have any timeline when those will be supported?

Describe alternatives you've considered

No response

Who will this benefit?

No response

Are you interested in contributing this feature?

No response

Anything else?

No response

miararoy · 2023-12-21T07:21:44Z

Yes, we have plans to add async routes, we are now planning for 2024Q1/2 so we should have better estimates soon

As for streaming, can you elaborate a bit more about it? What are you trying to achieve

usamasaleem1 · 2023-12-24T07:34:27Z

Same here, definitely need faster upsert methods. A bulk upload is taking forever.

miararoy · 2023-12-26T14:18:22Z

@usamasaleem1 @hustxx
We agree. This is more complex than looks (the async route for upsert is actually - processing, chunking embedding and upserting) some are IO bounded, some CPU - which is always a challenge to combine

Having said that we will start working on this next week - we will update this issue as we progress 🙏

Evanrsl · 2024-02-02T08:06:21Z

Yes, we have plans to add async routes, we are now planning for 2024Q1/2 so we should have better estimates soon

As for streaming, can you elaborate a bit more about it? What are you trying to achieve

I think streaming chatbot feature is essential, I did it on my anyscale inference and now I'm trying to move it to canopy, but i couldn't find any documentation about it. as for my anyscale chatbot i follow this docs https://docs.endpoints.anyscale.com/examples/openai-chat-agent/

scottmx81 · 2024-02-07T19:07:38Z

@Evanrsl Canopy already supports streaming responses. It's fully built into the codebase already. It implements the same interface as the OpenAI chat completion API. If stream=True is passed in the create call, with Canopy as the base URL instead of the Anyscale one (as in your link), you'll still get a streaming response the exact same way.

Evanrsl · 2024-02-12T07:51:50Z

@scottmx81 Thanks for your explanations, what about the model parameter? just to make sure is my code correct?

client = openai.OpenAI(
    base_url="http://0.0.0.0:8000/v1/chat/completions",
)

chat_completion = client.chat.completions.create( 
           messages = "test message",
           model = ???
           stream = True
        )

scottmx81 · 2024-02-13T14:23:02Z

@Evanrsl the value for model depends on what LLM backend you are using in Canopy. If you are using the default OpenAI backend, then you'd have to read the OpenAI API reference to know what the valid values for model are. If you are using the Cohere backend, you'd read the Cohere docs to know what the valid model values are. Canopy passes the model value through to the underlying LLM, and the API for that LLM will determine which models you are allowed to use.

hustxx added the enhancement New feature or request label Dec 21, 2023

izellevy changed the title ~~Any plan to support async and streaming?~~ [Feature] Any plan to support async and streaming? Dec 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Any plan to support async and streaming? #233

[Feature] Any plan to support async and streaming? #233

hustxx commented Dec 21, 2023 •

edited

Loading

miararoy commented Dec 21, 2023

usamasaleem1 commented Dec 24, 2023

miararoy commented Dec 26, 2023

Evanrsl commented Feb 2, 2024

scottmx81 commented Feb 7, 2024 •

edited

Loading

Evanrsl commented Feb 12, 2024 •

edited

Loading

scottmx81 commented Feb 13, 2024

[Feature] Any plan to support async and streaming? #233

[Feature] Any plan to support async and streaming? #233

Comments

hustxx commented Dec 21, 2023 • edited Loading

Is this your first time submitting a feature request?

Describe the feature

Describe alternatives you've considered

Who will this benefit?

Are you interested in contributing this feature?

Anything else?

miararoy commented Dec 21, 2023

usamasaleem1 commented Dec 24, 2023

miararoy commented Dec 26, 2023

Evanrsl commented Feb 2, 2024

scottmx81 commented Feb 7, 2024 • edited Loading

Evanrsl commented Feb 12, 2024 • edited Loading

scottmx81 commented Feb 13, 2024

hustxx commented Dec 21, 2023 •

edited

Loading

scottmx81 commented Feb 7, 2024 •

edited

Loading

Evanrsl commented Feb 12, 2024 •

edited

Loading