vLLMvLLM/Recipes
BrowseDocsGitHub
Providers
Arcee AI
Ernie (Baidu)
Seed (ByteDance)
DeepSeek
Google
inclusionAI
InternLM
Jina AI
Meituan LongCat
Meta
Microsoft
MiniMax
Mistral AI
Moonshot AI
Kimi-K2.6Kimi-K2.5Kimi-K2-ThinkingKimi-Linear-48B-A3B-InstructKimi-K2-Instruct
NVIDIA
OpenAI
InternVL (OpenGVLab)
PaddlePaddle
Poolside
Qwen
Stability AI
StepFun
Tencent Hunyuan
Wan AI
Xiaomi MiMo
GLM (Z-AI)

Moonshot AI

moonshotai·5 recipesHuggingFace

Multimodal

2
Kimi-K2.6
1T / 32B
moe
INT4714G
v0.19.1+→
Kimi-K2.5
1T / 32B
moe
INT4714GNVFP4600G
v0.19.1+→

Text

3
Kimi-K2-Thinking
1T / 32B
moe
INT4600GNVFP4600G
v0.12.0+→
Kimi-Linear-48B-A3B-Instruct
48B / 3B
moe
BF16115G
v0.11.2+→
Kimi-K2-Instruct
1T / 32B
moe
FP81200G
v0.12.0+→
GitHubRequest a recipeDocumentationSupported Models & HardwareInstall vLLMJSON API