[vllm] Validate speculative model config #2415

xyang16 · 2024-10-02T22:53:17Z

Description

Brief description of what this PR is about

If this change is a backward incompatible change, why must this change be made?
Interesting edge cases to note here

davidthomas426 · 2024-10-03T00:09:11Z

engines/python/setup/djl_python/properties_manager/vllm_rb_properties.py

+    def validate_speculative_model(self):
+        if self.speculative_model is not None and not self.use_v2_block_manager:
+            raise ValueError(
+                "Speculative decoding requires usage of the V2 block manager. Enable it with option.use_v2_block_manager=true."


That's actually not true. [EDIT: I was wrong! Check next comment for details]

Ok, just double-checked. It actually is true. I was confused by the inconsistency between these two pieces of code in vllm v0.6.2:

https://github.com/vllm-project/vllm/blob/f58d4fccc9b270838be438f5f0db71bea156a56d/vllm/engine/arg_utils.py#L964-L968 - sets the variable with a warning log if not set and it's required

https://github.com/vllm-project/vllm/blob/7193774b1ff8603ad5bf4598e5efba0d9a39b436/vllm/config.py#L1203-L1206 - throws error if not set and it's required

[vllm] Validate speculative model config

2d804c8

xyang16 requested review from zachgk and a team as code owners October 2, 2024 22:53

sindhuvahinis approved these changes Oct 2, 2024

View reviewed changes

xyang16 merged commit 45454f7 into deepjavalibrary:master Oct 2, 2024
9 checks passed

davidthomas426 reviewed Oct 3, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[vllm] Validate speculative model config #2415

[vllm] Validate speculative model config #2415

xyang16 commented Oct 2, 2024

davidthomas426 Oct 3, 2024 •

edited

Loading

davidthomas426 Oct 3, 2024 •

edited

Loading

[vllm] Validate speculative model config #2415

[vllm] Validate speculative model config #2415

Conversation

xyang16 commented Oct 2, 2024

Description

davidthomas426 Oct 3, 2024 • edited Loading

Choose a reason for hiding this comment

davidthomas426 Oct 3, 2024 • edited Loading

Choose a reason for hiding this comment

davidthomas426 Oct 3, 2024 •

edited

Loading

davidthomas426 Oct 3, 2024 •

edited

Loading