Re-use client or create new one each time? #783

datdo-msft · 2024-09-19T19:58:37Z

Hi, my question is about using the Triton client within a FastAPI server to send requests downstream to Triton. Is it recommended to create a single instance of the Triton client and re-use it for each request? Or should we create a new instance of the Triton client for each new request?

Asking because for streaming requests, the grpc InferenceServerClient only supports one stream at a time. Please let me know and thanks in advance!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Re-use client or create new one each time? #783

Re-use client or create new one each time? #783

datdo-msft commented Sep 19, 2024

Re-use client or create new one each time? #783

Re-use client or create new one each time? #783

Comments

datdo-msft commented Sep 19, 2024