Skip to content

[Feature] Faster Custom Paged Attention kernels#385

Draft
tjtanaa wants to merge 18 commits intoROCm:llama_fp8_12062024from EmbeddedLLM:pg_attn_to_llama_fp8

Commits

Commits on Nov 13, 2024

Commits on Dec 4, 2024

Commits on Dec 9, 2024

Commits on Dec 24, 2024

Commits on Dec 26, 2024

Commits on Jan 14, 2025

Commits on Jan 15, 2025

Commits on Jan 16, 2025

Commits on Jan 17, 2025

Commits on Jan 20, 2025

Commits on Jan 22, 2025

Commits on Jan 23, 2025