vLLMvLLM/Recipes
BrowseDocsGitHub
Providers
Arcee AI
Ernie (Baidu)
Seed (ByteDance)
DeepSeek
Google
inclusionAI
InternLM
Jina AI
Meituan LongCat
Meta
Microsoft
MiniMax
Mistral AI
Moonshot AI
NVIDIA
OpenAI
InternVL (OpenGVLab)
PaddlePaddle
Poolside
Qwen
Qwen3.6-27BQwen3.6-35B-A3BQwen3.5-0.8BQwen3.5-2BQwen3.5-4BQwen3.5-9BQwen3.5-122B-A10BQwen3.5-27BQwen3.5-35B-A3BQwen3.5-397B-A17BQwen3-ASR-1.7BQwen3Guard-Gen-8BQwen3-VL-235B-A22B-InstructQwen3-Next-80B-A3B-InstructQwen-ImageQwen3-Coder-480B-A35B-InstructQwen3-235B-A22B-Instruct-2507Qwen2.5-VL-72B-Instruct
Stability AI
StepFun
Tencent Hunyuan
Wan AI
Xiaomi MiMo
GLM (Z-AI)

Qwen

Qwen·18 recipesHuggingFace

Multimodal

13
Qwen3.6-27B
27B
dense
BF1665GFP833G
v0.17.0+→
Qwen3.6-35B-A3B
35B / 3B
moe
BF1684GFP842G
v0.17.0+→
Qwen3.5-0.8B
0.8B
dense
BF162G
v0.17.0+→
Qwen3.5-2B
2B
dense
BF165G
v0.17.0+→
Qwen3.5-4B
4B
dense
BF1610G
v0.17.0+→
Qwen3.5-9B
9B
dense
BF1622G
v0.17.0+→
Qwen3.5-122B-A10B
122B / 10B
moe
BF16293GFP8147GINT474G
v0.17.0+→
Qwen3.5-27B
27B
dense
BF1665GFP833GINT417G
v0.17.0+→
Qwen3.5-35B-A3B
35B / 3B
moe
BF1684GFP842GINT421G
v0.17.0+→
Qwen3.5-397B-A17B
397B / 17B
moe
BF16953GNVFP4238GINT4239G
v0.17.0+→
Qwen3-ASR-1.7B
2.3B
dense
BF164G
v0.12.0+→
Qwen3-VL-235B-A22B-Instruct
235B / 22B
moe
BF16564GFP8282GNVFP4141G
v0.11.0+→
Qwen2.5-VL-72B-Instruct
72B
dense
BF16173GINT443G
v0.7.0+→

Text

4
Qwen3Guard-Gen-8B
8B
dense
BF1619GBF1610GBF164G
v0.10.0+→
Qwen3-Next-80B-A3B-Instruct
80B / 3B
moe
BF16192GFP896GNVFP448G
v0.10.0+→
Qwen3-Coder-480B-A35B-Instruct
480B / 35B
moe
BF161152GFP8576GNVFP4288G
v0.10.0+→
Qwen3-235B-A22B-Instruct-2507
235B / 22B
moe
BF16564GFP8240GNVFP4141G
v0.10.0+→

Omni

1
Qwen-Image
20B
dense
BF1648GFP824GINT824G
v0.18.0+→
GitHubRequest a recipeDocumentationSupported Models & HardwareInstall vLLMJSON API