SGLang Integration + Accuracy Tests, Restructure app_tests/integration_tests #570

stbaione · 2024-11-19T17:02:59Z

Description

This PR implements integration tests for the Shortfin LLM Server w/ the SGLang integration. It uses llama3-8b-instruct on GPU, which is downloaded using sharktank's hf_datasets script.

The tests server two purposes:

Test that the SGLang integration works properly at a functional level.
Test that the accuracy of the responses from the shortfin LLM server are consistent.
- We have a batch of candidate questions, with expected answers
- We have temperature set to 1.0, so the responses should be deterministic.

This test is intended to run every 4 hours, which allows for us to detect degradations in shortfin LLM output accuracy. If we do get a failure due to an accuracy degradation, there will only be a small set of shark-ai/iree commits that could be responsible.

app_tests/integration_tests/llm/sglang/conftest.py

app_tests/integration_tests/llm/sglang/sglang_frontend_test.py

Restructure app_tests/integration_tests, Add copyright headers to files in integration_tests that were missing it

Add more logging and a little cleanup in sglang_frontend_test

stbaione requested a review from renxida November 19, 2024 17:02

stbaione self-assigned this Nov 19, 2024

renxida approved these changes Nov 19, 2024

View reviewed changes

app_tests/integration_tests/llm/sglang/conftest.py Show resolved Hide resolved

app_tests/integration_tests/llm/sglang/sglang_frontend_test.py Show resolved Hide resolved

stbaione added 4 commits November 19, 2024 16:41

Implement sglang integration tests,

808511e

Restructure app_tests/integration_tests, Add copyright headers to files in integration_tests that were missing it

Don't pin iree-base-compiler and iree-base-runtime

8cacfd1

Fix path to sglang integration tests

5cac718

Remove PR trigger from workflow,

627af2d

Add more logging and a little cleanup in sglang_frontend_test

renxida force-pushed the slg-integration-tests branch from 6f56b59 to 627af2d Compare November 19, 2024 21:41

stbaione merged commit ac17f86 into nod-ai:main Nov 19, 2024
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SGLang Integration + Accuracy Tests, Restructure app_tests/integration_tests #570

SGLang Integration + Accuracy Tests, Restructure app_tests/integration_tests #570

stbaione commented Nov 19, 2024

SGLang Integration + Accuracy Tests, Restructure app_tests/integration_tests #570

SGLang Integration + Accuracy Tests, Restructure app_tests/integration_tests #570

Conversation

stbaione commented Nov 19, 2024

Description