feat: Support sending additional outputs from vLLM inference #70

kthui · 2024-11-02T00:17:33Z

What does the PR do?

Add support for sending additional outputs from vLLM. At this step, the following 3 outputs are added:

finish reason
cumulative log probabilities
number of token ids

Checklist

Commit Type:

Check the conventional commit type
box here and add the label to the github PR.

Related PRs:

N/A

Where should the reviewer start?

N/A

Test plan:

A new test is added with this PR for all combinations on setting the 3 additional outputs and verifying the outputs are valid for each combination.

CI Pipeline ID: 20796825

Caveats:

N/A

Background

Outputs supported by vLLM in addition to text output: https://github.com/vllm-project/vllm/blob/v0.6.3.post1/vllm/outputs.py#L14-L40

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

N/A

* [WIP] Add additional outputs to auto complete * [WIP] Use individual input tensor to control per additional output * [WIP] Parse additional output flags from request

* Add additional outputs test * Update copyright * Some test enhancement and notes

ci/L0_additional_outputs_vllm/additional_outputs_test.py

ci/L0_additional_outputs_vllm/test.sh

docs/additional_outputs.md

ci/L0_additional_outputs_vllm/additional_outputs_test.py

docs/additional_outputs.md

This reverts commit 457eeaa.

…d into jacky-vllm-additional-outputs

kthui added 3 commits October 31, 2024 19:03

Add additional outputs and their input switches to auto complete

10a5b94

* [WIP] Add additional outputs to auto complete * [WIP] Use individual input tensor to control per additional output * [WIP] Parse additional output flags from request

chore: Refactor generate function

892f0d0

Add additional outputs to response

58ee481

kthui force-pushed the jacky-vllm-additional-outputs branch from 264d387 to 9fc7d0b Compare November 2, 2024 00:21

kthui self-assigned this Nov 2, 2024

Add test for additional outputs

5e605ca

* Add additional outputs test * Update copyright * Some test enhancement and notes

kthui force-pushed the jacky-vllm-additional-outputs branch from 9fc7d0b to 5e605ca Compare November 2, 2024 04:04

Add docs for additonal outputs

f35e9c4

kthui added the PR: feat A new feature label Nov 4, 2024

kthui changed the title ~~Support sending additional outputs from vLLM inference~~ feat: Support sending additional outputs from vLLM inference Nov 4, 2024

kthui marked this pull request as ready for review November 4, 2024 22:53

chore: Unify vLLM test names

e6e6404

kthui requested review from rmccorm4, GuanLuo, krishung5 and oandreeva-nv November 5, 2024 02:02

rmccorm4 reviewed Nov 6, 2024

View reviewed changes

ci/L0_additional_outputs_vllm/additional_outputs_test.py Outdated Show resolved Hide resolved

Switch to pytest

44edd6e

rmccorm4 reviewed Nov 6, 2024

View reviewed changes

ci/L0_additional_outputs_vllm/test.sh Outdated Show resolved Hide resolved

pytest to dump additional outputs

1773dea

rmccorm4 reviewed Nov 6, 2024

View reviewed changes

docs/additional_outputs.md Show resolved Hide resolved

rmccorm4 reviewed Nov 7, 2024

View reviewed changes

ci/L0_additional_outputs_vllm/additional_outputs_test.py Outdated Show resolved Hide resolved

krishung5 reviewed Nov 7, 2024

View reviewed changes

docs/additional_outputs.md Outdated Show resolved Hide resolved

kthui added 4 commits November 6, 2024 16:48

Rename output_* to return_*

29099df

Return token ids instead of number of token ids

457eeaa

Revert "Return token ids instead of number of token ids"

5e9b09f

This reverts commit 457eeaa.

Rename num_token_ids to num_output_tokens

dae3c13

rmccorm4 previously approved these changes Nov 7, 2024

View reviewed changes

krishung5 previously approved these changes Nov 7, 2024

View reviewed changes

Merge branch 'main' of github.com:triton-inference-server/vllm_backen…

2b531dd

…d into jacky-vllm-additional-outputs

kthui dismissed stale reviews from krishung5 and rmccorm4 via 2b531dd November 25, 2024 23:40

kthui added 2 commits November 25, 2024 15:47

[chore] Fix pre-commit on utils/metrics.py

ccb3323

[docs] Update targeted release version

00aa413

kthui requested review from krishung5 and rmccorm4 November 26, 2024 01:50

rmccorm4 approved these changes Nov 26, 2024

View reviewed changes

kthui merged commit ceb5961 into main Nov 26, 2024
3 checks passed

kthui deleted the jacky-vllm-additional-outputs branch November 26, 2024 03:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Support sending additional outputs from vLLM inference #70

feat: Support sending additional outputs from vLLM inference #70

kthui commented Nov 2, 2024 •

edited

Loading

feat: Support sending additional outputs from vLLM inference #70

feat: Support sending additional outputs from vLLM inference #70

Conversation

kthui commented Nov 2, 2024 • edited Loading

What does the PR do?

Checklist

Commit Type:

Related PRs:

Where should the reviewer start?

Test plan:

Caveats:

Background

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

kthui commented Nov 2, 2024 •

edited

Loading