Skip to content

v1.21.0: SD3, Flux, MiniCPM, NanoLlava, VLM Quantization, XPU, PagedAttention

Latest
Compare
Choose a tag to compare
@IlyasMoutawwakil IlyasMoutawwakil released this 06 Dec 12:53
· 36 commits to main since this release

What's Changed

OpenVINO

Diffusers

VLMs Modeling

NNCF

IPEX

  • Unified XPU/CPU modeling with custom PagedAttention cache for LLMs by @sywangyi in #1009

INC

New Contributors

Full Changelog: v1.20.0...v1.21.0