Skip to content

[Model] Initial weight absorption impl for Deepseek-v2/v3 #1987

[Model] Initial weight absorption impl for Deepseek-v2/v3

[Model] Initial weight absorption impl for Deepseek-v2/v3 #1987

Annotations

6 warnings

Windows

succeeded Jan 16, 2025 in 11m 50s