We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
3508938
LMoE - HF inference (better quality, slower), vllm inference (much faster, much lower quality for some adapters)