Supported Models - Dedalus Docs

Dedalus routes to every major LLM provider. Models are addressed with the format provider/model-name, and you switch providers by changing a string.

Programmatic discovery: fetch the live list with GET /v1/models. Each model object includes capabilities (vision, tools, thinking, streaming) and provider_info (status, upstream API).

Providers

OpenAI

openai/gpt-5.2

Anthropic

anthropic/claude-opus-4-6

Google

google/gemini-3-pro-preview

xAI

xai/grok-4-1-fast-reasoning

DeepSeek

deepseek/deepseek-reasoner

Mistral

mistral/mistral-large-latest

Groq

groq/llama-3.3-70b-versatile

Cerebras

cerebras/llama-3.3-70b

Moonshot

moonshot/kimi-k2.5

Perplexity

perplexity/sonar-pro

Fireworks

fireworks/llama-v3p1-405b

Cohere

cohere/command-r-plus

Model catalog

OpenAI

Chat

openai/gpt-5.2
openai/gpt-5.1
openai/gpt-5
openai/gpt-5-mini
openai/gpt-5-nano
openai/gpt-5-chat-latest
openai/gpt-4.1
openai/gpt-4.1-mini
openai/gpt-4.1-nano
openai/gpt-4o
openai/gpt-4o-2024-05-13
openai/chatgpt-4o-latest
openai/gpt-4-turbo
openai/gpt-4
openai/gpt-3.5-turbo

Reasoning

openai/o1
openai/o3
openai/o3-mini
openai/o4-mini

Image generation

openai/dall-e-3

Audio transcription

openai/whisper-1

Embeddings

Model	Price
`openai/text-embedding-3-large`	$0.13 / 1M tokens
`openai/text-embedding-3-small`	$0.02 / 1M tokens
`openai/text-embedding-ada-002`	$0.10 / 1M tokens

Anthropic

Claude 4.6

anthropic/claude-opus-4-6

Claude 4.5

anthropic/claude-opus-4-5
anthropic/claude-sonnet-4-5-20250929
anthropic/claude-haiku-4-5-20251001

Claude 4

anthropic/claude-opus-4-1-20250805
anthropic/claude-opus-4-20250514

Claude 3.7

anthropic/claude-3-7-sonnet-20250219

Claude 3.5

anthropic/claude-3-5-haiku-20241022

Claude 3

anthropic/claude-3-haiku-20240307

Google

Gemini 3

google/gemini-3-pro-preview
google/gemini-3-flash-preview

Gemini 2.5

google/gemini-2.5-pro
google/gemini-2.5-flash
google/gemini-2.5-flash-lite

Gemini 2.0

google/gemini-2.0-flash
google/gemini-2.0-flash-exp
google/gemini-2.0-flash-001
google/gemini-2.0-flash-lite

Embeddings

google/text-embedding-004

xAI

Grok 4

xai/grok-4-1-fast-reasoning
xai/grok-4-1-fast-non-reasoning
xai/grok-4-fast-reasoning
xai/grok-4-fast-non-reasoning
xai/grok-code-fast-1
xai/grok-4-0709

Grok 3

xai/grok-3
xai/grok-3-mini

Grok 2

xai/grok-2-vision-1212

DeepSeek

deepseek/deepseek-chat
deepseek/deepseek-reasoner
deepseek/deepseek-coder

Mistral

mistral/mistral-large-latest
mistral/mistral-medium-latest
mistral/mistral-small-latest
mistral/codestral-2508
mistral/open-mistral-nemo-2407
mistral/pixtral-12b

Groq

Lightning-fast inference for open-source models.

groq/llama-3.1-8b-instant
groq/llama-3.3-70b-versatile
groq/openai/gpt-oss-120b
groq/openai/gpt-oss-20b
groq/whisper-large-v3
groq/whisper-large-v3-turbo

Cerebras

Ultra-fast inference on custom silicon.

Production

cerebras/llama3.1-8b
cerebras/llama-3.3-70b
cerebras/gpt-oss-120b
cerebras/qwen-3-32b

Preview

cerebras/qwen-3-235b-a22b-instruct-2507
cerebras/zai-glm-4.7

Moonshot

moonshot/kimi-k2.5
moonshot/kimi-k2-0905-preview
moonshot/kimi-k2-0711-preview
moonshot/kimi-k2-turbo-preview
moonshot/kimi-k2-thinking
moonshot/kimi-k2-thinking-turbo

Recommendations by use case

Tool calling and function use

anthropic/claude-opus-4-6 - strongest tool calling with structured outputs
anthropic/claude-sonnet-4-5-20250929 - fast, reliable tool use
openai/gpt-5.2 - native function calling with structured responses
openai/gpt-4o - reliable for production tool workflows
deepseek/deepseek-chat - multi-step reasoning with tools

Coding

openai/gpt-5-codex - purpose-built for code generation
deepseek/deepseek-coder - strong code-focused model
anthropic/claude-opus-4-6 - excellent code understanding
xai/grok-code-fast-1 - fast code-focused inference
mistral/codestral-2508 - open-source coding model

Reasoning

openai/o3 - deep reasoning for complex problems
openai/o1 - multi-step chain-of-thought
anthropic/claude-opus-4-6 - advanced reasoning capabilities
deepseek/deepseek-reasoner - specialized reasoning model
xai/grok-4-1-fast-reasoning - optimized for reasoning tasks

Speed and throughput

anthropic/claude-haiku-4-5-20251001 - fast Claude at lower cost
google/gemini-2.5-flash - optimized for throughput
openai/gpt-5-mini - lightweight, fast
openai/gpt-5-nano - ultra-fast for simple tasks
groq/llama-3.3-70b-versatile - Groq-accelerated open-source

Long context

google/gemini-3-pro-preview - 1M+ token context
google/gemini-2.5-pro - extended context with strong reasoning
anthropic/claude-opus-4-6 - long-context analysis
anthropic/claude-sonnet-4-5-20250929 - fast long-context

Vision and multimodal

openai/gpt-5.2 - strong multimodal capabilities
anthropic/claude-opus-4-6 - advanced vision understanding
google/gemini-3-pro-preview - vision plus long context
xai/grok-2-vision-1212 - multimodal with Grok
openai/gpt-4o - reliable vision for production

Most providers ship multiple tiers (mini / standard / pro / opus). Start on the smaller tier, scale up only when the benchmarks justify it.

​Providers

OpenAI

Anthropic

Google

xAI

DeepSeek

Mistral

Groq

Cerebras

Moonshot

Perplexity

Fireworks

Cohere

​Model catalog

​OpenAI

​Chat

​Reasoning

​Image generation

​Audio transcription

​Embeddings

​Anthropic

​Claude 4.6

​Claude 4.5

​Claude 4

​Claude 3.7

​Claude 3.5

​Claude 3

​Google

​Gemini 3

​Gemini 2.5

​Gemini 2.0

​Embeddings

​xAI

​Grok 4

​Grok 3

​Grok 2

​DeepSeek

​Mistral

​Groq

​Cerebras

​Production

​Preview

​Moonshot

​Recommendations by use case

​Tool calling and function use

​Coding

​Reasoning

​Speed and throughput

​Long context

​Vision and multimodal

Providers

Model catalog

OpenAI

Chat

Reasoning

Image generation

Audio transcription

Embeddings

Anthropic

Claude 4.6

Claude 4.5

Claude 4

Claude 3.7

Claude 3.5

Claude 3

Google

Gemini 3

Gemini 2.5

Gemini 2.0

Embeddings

xAI

Grok 4

Grok 3

Grok 2

DeepSeek

Mistral

Groq

Cerebras

Production

Preview

Moonshot

Recommendations by use case

Tool calling and function use

Coding

Reasoning

Speed and throughput

Long context

Vision and multimodal