Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[draft] feat: Add KV aware routing operator. Update llm example for KV aware example #107

Open
wants to merge 7 commits into
base: main
Choose a base branch
from

Conversation

GuanLuo
Copy link

@GuanLuo GuanLuo commented Feb 5, 2025

What does the PR do?

Checklist

  • PR title reflects the change and is of format <commit_type>: <Title>
  • Changes are described in the pull request.
  • Related issues are referenced.
  • Populated github labels field
  • Added test plan and verified test passes.
  • Verified that the PR passes existing CI.
  • Verified copyright is correct on all changed files.
  • Added succinct git squash message before merging ref.
  • All template sections are filled out.
  • Optional: Additional screenshots for behavior/output changes with before/after.

Commit Type:

Check the conventional commit type
box here and add the label to the github PR.

  • build
  • ci
  • docs
  • feat
  • fix
  • perf
  • refactor
  • revert
  • style
  • test

Related PRs:

Where should the reviewer start?

Test plan:

  • CI Pipeline ID:

Caveats:

Background

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

  • closes GitHub issue: #xxx

@GuanLuo GuanLuo changed the title feat: Add KV aware routing operator. Update llm example for KV aware example [draft] feat: Add KV aware routing operator. Update llm example for KV aware example Feb 5, 2025
Comment on lines +146 to +158
# [FIXME] build Rust library and bindings once the code is ready,
# for now, the pre-built artifacts are used
# cargo build --release
# RUN git clone --single-branch --branch gluo/pyrouter ${GITLAB_DIR} nova-distributed
# RUN pip install maturin[patchelf]
# RUN cd nova-distributed/python-wheels/nova-distributed && maturin build
# KV publisher C library
# RUN cd nova-distributed/rust/libnova_llm/ && cargo build --release
RUN pip install /workspace/triton-distributed/binding_package/wheels/nova_distributed*.whl

# 'etcd' is runtime dependency
RUN wget https://github.com/etcd-io/etcd/releases/download/v3.5.18/etcd-v3.5.18-linux-amd64.tar.gz && tar -xzf etcd-v3.5.18-linux-amd64.tar.gz
RUN cp ./etcd-v3.5.18-linux-amd64/etcd* /usr/local/bin/.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Comment on lines +137 to +144
# [FIXME] change naming to 'triton-distributed'
# Install Rust and build nova-distributed wheel
RUN curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh -s -- -y
RUN . "$HOME/.cargo/env"

# Rust 'triton-distributed' library dependency
# [WIP] 'protobuf-compiler' safe? We already have a version of it somewhere
RUN apt-get update && apt-get install -y libssl-dev pkg-config libhwloc-dev hwloc libudev-dev clang libclang-dev protobuf-compiler
Copy link
Contributor

@rmccorm4 rmccorm4 Feb 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FYI this cargo setup in container is already in latest main branch

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants