[V1][Metrics] Hook up IterationStats for Prometheus metrics #12478

markmc · 2025-01-27T18:20:36Z

Follow on from #12416, part of #10582

Pass IterationStats to the stat logger, and log them in both the logging and prometheus loggers.

For the logging stat logger, we need to calculate the throughput based on the number of tokens in the particular logging interval.

In the prometheus logger, we just need to record the prompt and generation tokens in a counter.

Note, v0 had a vllm:tokens_total counter registered that apparently was never logged to, so I've omitted it in v1.

Follow on from vllm-project#12416 Pass IterationStats to the stat logger, and log them in both the logging and prometheus loggers. For the logging stat logger, we need to calculate the throughput based on the number of tokens in the particular logging interval. In the prometheus logger, we just need to record the prompt and generation tokens in a counter. Note, v0 had a vllm:tokens_total counter registered that apparently was never logged to, so I've omitted it in v1. Signed-off-by: Mark McLoughlin <[email protected]>

github-actions · 2025-01-27T18:20:49Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

robertgshaw2-redhat · 2025-01-28T15:08:43Z

LGTM

…ject#12478) Signed-off-by: Mark McLoughlin <[email protected]>

…ject#12478) Signed-off-by: Mark McLoughlin <[email protected]> Signed-off-by: Isotr0py <[email protected]>

…ject#12478) Signed-off-by: Mark McLoughlin <[email protected]>

markmc requested review from DarkLight1337, robertgshaw2-redhat, simon-mo, WoosukKwon, njhill, ywang96, comaniac and alexm-redhat as code owners January 27, 2025 18:20

markmc changed the title ~~[V1][Metrics] Hook up IterationStats~~ [V1][Metrics] Hook up IterationStats for Prometheus metrics Jan 27, 2025

robertgshaw2-redhat approved these changes Jan 28, 2025

View reviewed changes

robertgshaw2-redhat added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 28, 2025

robertgshaw2-redhat enabled auto-merge (squash) January 28, 2025 15:08

mgoin approved these changes Jan 28, 2025

View reviewed changes

robertgshaw2-redhat merged commit 3fd1fb6 into vllm-project:main Jan 28, 2025
62 checks passed

markmc mentioned this pull request Jan 28, 2025

[V1][Metrics] Add per-request prompt/generation_tokens histograms #12516

Merged

rasmith pushed a commit to rasmith/vllm that referenced this pull request Jan 30, 2025

[V1][Metrics] Hook up IterationStats for Prometheus metrics (vllm-pro…

6b11564

…ject#12478) Signed-off-by: Mark McLoughlin <[email protected]>

Isotr0py pushed a commit to Isotr0py/vllm that referenced this pull request Feb 2, 2025

[V1][Metrics] Hook up IterationStats for Prometheus metrics (vllm-pro…

10f7ae2

…ject#12478) Signed-off-by: Mark McLoughlin <[email protected]> Signed-off-by: Isotr0py <[email protected]>

markmc deleted the metrics-v1-prometheus-logger-2 branch February 5, 2025 11:52

NickLucche pushed a commit to NickLucche/vllm that referenced this pull request Feb 7, 2025

[V1][Metrics] Hook up IterationStats for Prometheus metrics (vllm-pro…

3f70c88

…ject#12478) Signed-off-by: Mark McLoughlin <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[V1][Metrics] Hook up IterationStats for Prometheus metrics #12478

[V1][Metrics] Hook up IterationStats for Prometheus metrics #12478

markmc commented Jan 27, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Jan 27, 2025

robertgshaw2-redhat commented Jan 28, 2025

[V1][Metrics] Hook up IterationStats for Prometheus metrics #12478

[V1][Metrics] Hook up IterationStats for Prometheus metrics #12478

Conversation

markmc commented Jan 27, 2025 • edited by github-actions bot Loading

github-actions bot commented Jan 27, 2025

robertgshaw2-redhat commented Jan 28, 2025

markmc commented Jan 27, 2025 •

edited by github-actions bot

Loading