Skip to content

[FEAT] Improved PagedAttention FP8 (faster kvcache dequant v1)#346

Closed
tjtanaa wants to merge 2 commits intoROCm:llama_fp8_12062024from EmbeddedLLM:paged-attn-fp8

Commits

Commits on Dec 20, 2024