Add first two LLM test guides #396

matthewkotila · 2023-09-14T00:45:46Z

Add guides for prefill and generation steps of LLM.

debermudez

Looks great!
Only thing is that it is not clear that 2 different experiments are described here. Can you add 2 headers that describe the 1st token latency and T2T latency?

src/c++/perf_analyzer/docs/llm.md

src/c++/perf_analyzer/docs/examples/calculate_avg_first_token_latency.py

src/c++/perf_analyzer/docs/examples/calculate_avg_token_to_token_latency.py

Tabrizian

Some minor comments, otherwise looks good!

src/c++/perf_analyzer/docs/llm.md

src/c++/perf_analyzer/docs/examples/calculate_avg_first_token_latency.py

src/c++/perf_analyzer/docs/examples/calculate_avg_token_to_token_latency.py

src/c++/perf_analyzer/docs/llm.md

nv-hwoo

LGTM! 🚀

matthewkotila · 2023-09-20T20:30:22Z

Comment to make sure triton-inference-server/tutorials#46 is taken into account for how this guide works.

matthewkotila requested review from debermudez, Tabrizian and nv-hwoo September 14, 2023 00:45

debermudez requested changes Sep 14, 2023

View reviewed changes

nv-hwoo reviewed Sep 18, 2023

View reviewed changes

debermudez approved these changes Sep 18, 2023

View reviewed changes

Tabrizian reviewed Sep 18, 2023

View reviewed changes

matthewkotila force-pushed the matthewkotila-llm-guide branch from 7d979cf to b44ab4e Compare September 19, 2023 00:47

matthewkotila requested review from Tabrizian, nv-hwoo and debermudez September 19, 2023 00:48

github-advanced-security bot found potential problems Sep 19, 2023

View reviewed changes

src/c++/perf_analyzer/docs/examples/calculate_avg_first_token_latency.py Fixed Show resolved Hide resolved

src/c++/perf_analyzer/docs/examples/calculate_avg_token_to_token_latency.py Fixed Show resolved Hide resolved

Tabrizian reviewed Sep 19, 2023

View reviewed changes

matthewkotila requested a review from Tabrizian September 19, 2023 17:42

nv-hwoo reviewed Sep 19, 2023

View reviewed changes

src/c++/perf_analyzer/docs/llm.md Show resolved Hide resolved

src/c++/perf_analyzer/docs/llm.md Outdated Show resolved Hide resolved

src/c++/perf_analyzer/docs/llm.md Show resolved Hide resolved

src/c++/perf_analyzer/docs/llm.md Outdated Show resolved Hide resolved

Tabrizian approved these changes Sep 19, 2023

View reviewed changes

matthewkotila added 6 commits September 19, 2023 20:35

Add first two LLM test guides

963350c

Address feedback

1267917

Fix CodeQL issue

1e6200e

Address feedback

ae89be8

Address feedback

347dd29

Add note for feature availability

4281537

matthewkotila force-pushed the matthewkotila-llm-guide branch from f84cc02 to 4281537 Compare September 19, 2023 20:36

nv-hwoo approved these changes Sep 20, 2023

View reviewed changes

Update llm.md

5ab3e6b

matthewkotila marked this pull request as draft September 21, 2023 02:58

Update llm.md

dd5758d

matthewkotila marked this pull request as ready for review September 22, 2023 19:32

matthewkotila requested a review from nv-hwoo September 22, 2023 19:32

matthewkotila requested a review from Tabrizian September 22, 2023 19:32

debermudez approved these changes Sep 22, 2023

View reviewed changes

Update llm.md

2ee2208

nv-hwoo approved these changes Sep 22, 2023

View reviewed changes

matthewkotila merged commit 930749c into main Sep 22, 2023
3 checks passed

matthewkotila deleted the matthewkotila-llm-guide branch September 22, 2023 21:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add first two LLM test guides #396

Add first two LLM test guides #396

matthewkotila commented Sep 14, 2023

debermudez left a comment

Tabrizian left a comment •

edited

Loading

nv-hwoo left a comment

matthewkotila commented Sep 20, 2023

Add first two LLM test guides #396

Add first two LLM test guides #396

Conversation

matthewkotila commented Sep 14, 2023

debermudez left a comment

Choose a reason for hiding this comment

Tabrizian left a comment • edited Loading

Choose a reason for hiding this comment

nv-hwoo left a comment

Choose a reason for hiding this comment

matthewkotila commented Sep 20, 2023

Tabrizian left a comment •

edited

Loading