Add instructions for running vLLM backend #8

dyastremsky · 2023-10-10T03:35:09Z

Draft documentation to allow users to quickly use the vLLM backend to run their models.

samples/client.py

samples/model_repository/vllm_opt/config.pbtxt

Co-authored-by: Neelay Shah <[email protected]>

…m_backend into dyas-README

…ver/vllm_backend into dyas-README

samples/client.py

README.md

rmccorm4 · 2023-10-17T19:42:44Z

README.md

+```
+mkdir -p /opt/tritonserver/backends/vllm
+wget -P /opt/tritonserver/backends/vllm https://raw.githubusercontent.com/triton-inference-server/vllm_backend/main/src/model.py
+```


Not an action item here, but a random food for thought that could be nice for both users and developers. If we standardize on a certain python-based-backend git repository structure, we can do something like:

git clone https://github.com/triton-inference-server/vllm_backend.git /opt/tritonserver/backends

Single command

Developers could iterate on the backend directly in the git repo and just reload triton without copying files/builds around (developer experience)

More support for multi-file implementations. The wget is nice, but won't scale past a single file. Ex: Imagine model.py implements TritonPythonModel but imports implementation.py that has all the gorey details for certain features.

Just some random Tuesday ideas in my head. Core would just be updated to also look for src/model.py or whatever standard we set instead of just model.py.

this will not work with git clone, since required model.py is in sub-directory of vllm_backend, plus clone will clone tests as well.

We can discuss the best solution at some point.

For the ease of development, I think your earlier idea of symlinks makes more sense.

this will not work with git clone, since required model.py is in sub-directory of vllm_backend, plus clone will clone tests as well.

I know it won't work as-is and would require minor changes. Not necessarily asking for this feature at this time, just food for thought.

We have a separate goal of improving python backend developer experience (more for things like debugging, ipdb, etc) somewhere in the pipeline, so this came to mind as a tangential idea.

I see, by any chance, do you know in what ticket this is tracked? If you don't remember, then no worries

Co-authored-by: Ryan McCormick <[email protected]>

README.md

tanmayv25 · 2023-10-17T23:54:23Z

LGTM besides a minor suggestion. Great work @dyastremsky !

Co-authored-by: Tanmay Verma <[email protected]>

README.md

Co-authored-by: Neelay Shah <[email protected]>

oandreeva-nv · 2023-10-18T17:04:34Z

Amazing work on this, @dyastremsky !

Co-authored-by: Neelay Shah <[email protected]> Co-authored-by: Olga Andreeva <[email protected]> Co-authored-by: Ryan McCormick <[email protected]> Co-authored-by: Tanmay Verma <[email protected]>

Draft README and samples

1688a33

dyastremsky self-assigned this Oct 10, 2023

github-advanced-security bot found potential problems Oct 10, 2023

View reviewed changes

samples/client.py Fixed Show fixed Hide fixed

dyastremsky added 3 commits October 9, 2023 20:39

Run pre-commit

0ba6200

Remove unused queue.

a4921c1

Fixes for README

92124bf

dyastremsky requested review from tanmayv25, pskiran1 and oandreeva-nv October 10, 2023 03:44

dyastremsky changed the title ~~Draft README and samples~~ Draft README and samples for vLLM backend Oct 10, 2023

dyastremsky added 2 commits October 9, 2023 20:48

Add client.py shebang

aa8a105

Add Conda instructions.

ed108d0

dyastremsky changed the title ~~Draft README and samples for vLLM backend~~ Add instructions for running vLLM backend Oct 10, 2023

Spacing, title

c5213f6

rmccorm4 reviewed Oct 10, 2023

View reviewed changes

samples/model_repository/vllm_opt/config.pbtxt Outdated Show resolved Hide resolved

nnshah1 reviewed Oct 10, 2023

View reviewed changes

samples/model_repository/vllm_opt/config.pbtxt Outdated Show resolved Hide resolved

nnshah1 reviewed Oct 10, 2023

View reviewed changes

samples/model_repository/vllm_opt/config.pbtxt Outdated Show resolved Hide resolved

nnshah1 reviewed Oct 10, 2023

View reviewed changes

samples/model_repository/vllm_opt/config.pbtxt Outdated Show resolved Hide resolved

nnshah1 reviewed Oct 10, 2023

View reviewed changes

samples/model_repository/vllm_opt/config.pbtxt Outdated Show resolved Hide resolved

dyastremsky and others added 8 commits October 10, 2023 13:49

Switch i/o to lowercase

2c6881c

Co-authored-by: Neelay Shah <[email protected]>

Switch i/o to lowercase

ac33407

Co-authored-by: Neelay Shah <[email protected]>

Switch i/o to lowercase

d2fdb3f

Co-authored-by: Neelay Shah <[email protected]>

Switch i/o to lowercase

02c1167

Co-authored-by: Neelay Shah <[email protected]>

Change client code to use lowercase inputs/outputs

d164dab

Merge branch 'main' of https://github.com/triton-inference-server/vll…

5ed4d0e

…m_backend into dyas-README

Merge branch 'dyas-README' of https://github.com/triton-inference-ser…

0cd3d91

…ver/vllm_backend into dyas-README

Update client to use iterable client class

45a531f

github-advanced-security bot found potential problems Oct 11, 2023

View reviewed changes

samples/client.py Fixed Show fixed Hide fixed

samples/client.py Fixed Show fixed Hide fixed

samples/client.py Fixed Show fixed Hide fixed

dyastremsky added 2 commits October 10, 2023 19:38

Rename vLLM model, add note to config

1e27105

Remove unused imports and vars

97417c5

rmccorm4 reviewed Oct 17, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

rmccorm4 reviewed Oct 17, 2023

View reviewed changes

dyastremsky and others added 3 commits October 17, 2023 13:07

Add quotes to shm-region-prefix-name

48e08e7

Co-authored-by: Ryan McCormick <[email protected]>

Update sentence ordering, remove extra issues link

9b4a193

Co-authored-by: Ryan McCormick <[email protected]>

Modify input text example, one arg per line

45be0f6

dyastremsky requested review from rmccorm4 and tanmayv25 October 17, 2023 20:12

tanmayv25 reviewed Oct 17, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

Remove line about CUDA version compatibility.

204ce5a

Co-authored-by: Tanmay Verma <[email protected]>

nnshah1 reviewed Oct 18, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

nnshah1 reviewed Oct 18, 2023

View reviewed changes

README.md Show resolved Hide resolved

nnshah1 reviewed Oct 18, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

nnshah1 reviewed Oct 18, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

nnshah1 reviewed Oct 18, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

dyastremsky and others added 4 commits October 18, 2023 07:54

Wording of Triton container option

8c9c4e7

Co-authored-by: Neelay Shah <[email protected]>

Update wording of pre-built Docker container option

3ab4774

Co-authored-by: Neelay Shah <[email protected]>

Update README.md wording

757e2b2

Co-authored-by: Neelay Shah <[email protected]>

Update wording - add "the"

aa9ec65

Co-authored-by: Neelay Shah <[email protected]>

dyastremsky requested review from nnshah1 and tanmayv25 October 18, 2023 21:33

tanmayv25 previously approved these changes Oct 18, 2023

View reviewed changes

Standarize capitalization, headings

e0161f4

dyastremsky dismissed tanmayv25’s stale review via e0161f4 October 18, 2023 23:08

tanmayv25 self-requested a review October 18, 2023 23:10

tanmayv25 approved these changes Oct 18, 2023

View reviewed changes

dyastremsky merged commit 912896b into main Oct 18, 2023

dyastremsky deleted the dyas-README branch October 18, 2023 23:23

Add instructions for running vLLM backend #8

Add instructions for running vLLM backend #8

Uh oh!

Conversation

dyastremsky commented Oct 10, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rmccorm4 Oct 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

oandreeva-nv Oct 18, 2023

Choose a reason for hiding this comment

Uh oh!

oandreeva-nv Oct 18, 2023

Choose a reason for hiding this comment

Uh oh!

rmccorm4 Oct 18, 2023

Choose a reason for hiding this comment

Uh oh!

rmccorm4 Oct 18, 2023

Choose a reason for hiding this comment

Uh oh!

oandreeva-nv Oct 18, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tanmayv25 commented Oct 17, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

oandreeva-nv commented Oct 18, 2023

Uh oh!

Uh oh!

rmccorm4 Oct 17, 2023 •

edited

Loading