Pinned Loading
Repositories
Showing 10 of 38 repositories
- GPTQModel Public Forked from ModelCloud/GPTQModel
Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.
AXERA-TECH/GPTQModel’s past year of commit activity - ax-samples Public
Samples code for world class Artificial Intelligence SoCs for computer vision applications.
AXERA-TECH/ax-samples’s past year of commit activity - SmolVLM-256M-Instruct.axera Public Forked from techshoww/SmolVLM-256M-Instruct.axera
Demo for SmolVLM-256M-Instruct on AXERA 650N
AXERA-TECH/SmolVLM-256M-Instruct.axera’s past year of commit activity - ax-llm-SmolVLM-256M Public Forked from techshoww/ax-llm
Explore LLM model deployment based on AXera's AI chips
AXERA-TECH/ax-llm-SmolVLM-256M’s past year of commit activity