Popular repositories Loading
-
LookaheadDecoding
LookaheadDecoding Public[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
-
Consistency_LLM
Consistency_LLM Public[ICML 2024] CLLMs: Consistency Large Language Models
Repositories
Showing 10 of 19 repositories
- hao-ai-lab.github.io Public
hao-ai-lab/hao-ai-lab.github.io’s past year of commit activity - Awesome-Video-Attention Public
A curated list of recent papers on efficient video attention for video diffusion models, including sparsification, quantization, and caching, etc.
hao-ai-lab/Awesome-Video-Attention’s past year of commit activity - ComfyUI Public Forked from comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
hao-ai-lab/ComfyUI’s past year of commit activity - vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
hao-ai-lab/vllm’s past year of commit activity - sglang Public Forked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
hao-ai-lab/sglang’s past year of commit activity
Most used topics
Loading…