Update v1_user_guide.md

vllm-project · Feb 27, 2025 · ff71ca6 · ff71ca6
1 parent eb6fd2b
commit ff71ca6
Showing 1 changed file with 3 additions and 2 deletions.
diff --git a/docs/source/getting_started/v1_user_guide.md b/docs/source/getting_started/v1_user_guide.md
@@ -20,10 +20,9 @@ Previous blog post [vLLM V1: A Major Upgrade to vLLM's Core Architecture](https:
 - logits_processors
 - beam_search
 
-#### Deprecated KV Cache
+#### Deprecated KV Cache features
 - KV Cache swapping
 - KV Cache offloading
-- FP8 KV Cache
 
 ## Unsupported features
 
@@ -37,6 +36,8 @@ Previous blog post [vLLM V1: A Major Upgrade to vLLM's Core Architecture](https:
 - **Quantization**: For V1, when the CUDA graph is enabled, it defaults to the 
   piecewise CUDA graph introduced in this [PR](https://github.com/vllm-project/vllm/pull/10058) ; consequently, FP8 and other quantizations are not supported. 
 
+- **FP8 KV Cache**: FP8 KV Cache is not yet supported in V1.
+
 ## Unsupported models
 
 All model with `SupportsV0Only` tag in the model definition is not supported by V1.