Feature request: Provide support for concurrent HTTP requests #75

fullymiddleaged · 2024-12-28T09:45:24Z

Howdy! :)

I appreciate this could be a challenge in C++, but as onnxruntime supports multi-threading it would be great if the HTTP/TCP server side of this app did too and created a new thread per request. This would better utilise the CPU as well as provide an improved client experience. At the moment, if I test concurrent/over-lapping HTTP requests I see the results coming back in serial. This leads to a sharp increase in response times for each concurrent user.

E.g. If my model takes around 500ms to respond, I am seeing average times of 2500ms when tested with 5 concurrent users.

I'm not sure if the TCP API is different and perhaps supports concurrent requests instead of HTTP.

Let me know what you think!

kibae · 2024-12-31T08:00:45Z

Hello, @fullymiddleaged :)

The onnxruntime-server currently operates using a thread pool, at least for the TCP server. After reviewing your question, I noticed that the HTTP/HTTPS server does not seem to utilize the thread pool. I appreciate you bringing this to our attention.

I’m currently working on addressing this issue. It seems that a thread-per-client model might be a better fit for handling concurrent HTTP requests effectively. I’ll explore potential solutions and get back to you with an update soon.

Thank you for your patience!

…(breaking change)

…(breaking change) (#76) * feat: #75 #74 Remove thread pool and create a thread for each client (breaking change)

kibae · 2024-12-31T13:55:53Z

Hi, @fullymiddleaged

I have resolved this issue. I usually align my releases with the ONNX Runtime release cycle. Can you wait until a version after 1.20.1 is released? If it’s urgent, I can release a 1.20.1a version.

…n execution tasks. (#78)

fullymiddleaged · 2025-01-01T04:07:21Z

Hey @kibae!

Happy new year! And no worries, im glad I raised it and you have resolved. I can probably wait until the next release for this but looking forward to testing then!

kibae added the bug Something isn't working label Dec 31, 2024

kibae self-assigned this Dec 31, 2024

kibae mentioned this issue Dec 31, 2024

Logging query - single POST shows as 3 entries in log #74

Open

kibae added a commit that referenced this issue Dec 31, 2024

feat: #75 #74 Remove thread pool and create a thread for each client …

ed952a1

…(breaking change)

kibae added a commit that referenced this issue Dec 31, 2024

feat: #75 #74 Remove thread pool and create a thread for each client …

f33a7ee

…(breaking change) (#76) * feat: #75 #74 Remove thread pool and create a thread for each client (breaking change)

kibae linked a pull request Dec 31, 2024 that will close this issue

feat: #75 Allocate a separate thread pool exclusively for ONNX session execution tasks. #78

Merged

kibae removed a link to a pull request Dec 31, 2024

feat: #75 Allocate a separate thread pool exclusively for ONNX session execution tasks. #78

Merged

kibae added a commit that referenced this issue Dec 31, 2024

feat: #75 Allocate a separate thread pool exclusively for ONNX sessio…

90b9662

…n execution tasks. (#78)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: Provide support for concurrent HTTP requests #75

Feature request: Provide support for concurrent HTTP requests #75

fullymiddleaged commented Dec 28, 2024 •

edited

Loading

kibae commented Dec 31, 2024

kibae commented Dec 31, 2024

fullymiddleaged commented Jan 1, 2025

Feature request: Provide support for concurrent HTTP requests #75

Feature request: Provide support for concurrent HTTP requests #75

Comments

fullymiddleaged commented Dec 28, 2024 • edited Loading

kibae commented Dec 31, 2024

kibae commented Dec 31, 2024

fullymiddleaged commented Jan 1, 2025

fullymiddleaged commented Dec 28, 2024 •

edited

Loading