Hugging Face
Use the hf provider for Hugging Face Inference Providers.
Use hf.<model_name>[:provider] to specify models. If no provider suffix is supplied, Hugging Face auto-routes the request.
fast-agent --model kimi
fast-agent --model kimi26instant
fast-agent --model hf.openai/gpt-oss-120b
fast-agent --model hf.moonshotai/kimi-k2-instruct-0905:groq
fast-agent --model "hf.moonshotai/Kimi-K2.6:novita?reasoning=on"
Curated aliases such as kimi, deepseek-hf, glm, and minimax include provider choices and request defaults tested with fast-agent features such as structured outputs and tool use. Capability can still vary by backing provider.
Kimi instant mode
Kimi models that support instant mode can disable reasoning with the instant query parameter:
fast-agent --model "hf.moonshotai/Kimi-K2.5?instant=on" # thinking disabled
fast-agent --model "hf.moonshotai/Kimi-K2.5?instant=off" # thinking enabled
Hugging Face MCP authentication
HF_TOKEN is automatically applied when connecting to Hugging Face MCP servers:
hf.co/huggingface.cousesAuthorization: Bearer {HF_TOKEN}*.hf.spaceuses bothAuthorization: Bearer {HF_TOKEN}andX-HF-Authorization: Bearer {HF_TOKEN}
Model aliases
| Model Alias | Maps to |
|---|---|
deepseek-ai/deepseek-v4-pro |
deepseek-ai/deepseek-v4-pro |
deepseek-hf |
hf.deepseek-ai/DeepSeek-V4-Pro:together |
deepseek32 |
hf.deepseek-ai/DeepSeek-V3.2:fireworks-ai |
deepseek4-hf |
hf.deepseek-ai/DeepSeek-V4-Pro:together |
deepseek4pro-hf |
hf.deepseek-ai/DeepSeek-V4-Pro:together |
deepseekv4pro-hf |
hf.deepseek-ai/DeepSeek-V4-Pro:together |
glm |
hf.zai-org/GLM-5.1:together |
glm47 |
hf.zai-org/GLM-4.7:cerebras |
glm5 |
hf.zai-org/GLM-5:novita |
glm51 |
hf.zai-org/GLM-5.1:together |
gpt-oss |
hf.openai/gpt-oss-120b:cerebras |
gpt-oss-20b |
hf.openai/gpt-oss-20b |
kimi |
hf.moonshotai/Kimi-K2.6:novita?temperature=1.0&top_p=0.95&reasoning=on |
kimi-2.5 |
hf.moonshotai/Kimi-K2.5:novita?temperature=1.0&top_p=0.95&reasoning=on |
kimi-2.6 |
hf.moonshotai/Kimi-K2.6:novita?temperature=1.0&top_p=0.95&reasoning=on |
kimi25 |
hf.moonshotai/Kimi-K2.5:novita?temperature=1.0&top_p=0.95&reasoning=on |
kimi25instant |
hf.moonshotai/Kimi-K2.5:novita?temperature=0.6&top_p=0.95&reasoning=off |
kimi26 |
hf.moonshotai/Kimi-K2.6:novita?temperature=1.0&top_p=0.95&reasoning=on |
kimi26instant |
hf.moonshotai/Kimi-K2.6:novita?temperature=0.6&top_p=0.95&reasoning=off |
kimithink |
hf.moonshotai/Kimi-K2.6:novita?temperature=1.0&top_p=0.95&reasoning=on |
minimax |
hf.MiniMaxAI/MiniMax-M2.7:fireworks-ai?temperature=1.0&top_p=0.95&top_k=40 |
minimax2.5 |
hf.MiniMaxAI/MiniMax-M2.5:novita?temperature=1.0&top_p=0.95&top_k=40 |
minimax21 |
hf.MiniMaxAI/MiniMax-M2.1:novita |
minimax25 |
hf.MiniMaxAI/MiniMax-M2.5:fireworks-ai?temperature=1.0&top_p=0.95&top_k=40 |
minimax27 |
hf.MiniMaxAI/MiniMax-M2.7:fireworks-ai?temperature=1.0&top_p=0.95&top_k=40 |
moonshotai/kimi-k2 |
moonshotai/kimi-k2 |
moonshotai/kimi-k2-instruct-0905 |
moonshotai/kimi-k2-instruct-0905 |
moonshotai/kimi-k2-thinking |
moonshotai/kimi-k2-thinking |
moonshotai/kimi-k2.5 |
moonshotai/kimi-k2.5 |
moonshotai/kimi-k2.6 |
moonshotai/kimi-k2.6 |
qwen/qwen3.5-397b-a17b |
qwen/qwen3.5-397b-a17b |
qwen35 |
hf.Qwen/Qwen3.5-397B-A17B:novita?temperature=0.6&top_p=0.95&top_k=20&min_p=0.0&presence_penalty=0.0&repetition_penalty=1.0&reasoning=on |
qwen35instruct |
hf.Qwen/Qwen3.5-397B-A17B:novita?temperature=0.7&top_p=0.8&top_k=20&min_p=0.0&presence_penalty=1.5&repetition_penalty=1.0&reasoning=off |