> ## Documentation Index
> Fetch the complete documentation index at: https://docs.dedaluslabs.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Supported Models

> Every model from every major provider, reachable through one OpenAI-compatible endpoint.

Dedalus routes to every major LLM provider. Models are addressed with the format `provider/model-name`, and you switch providers by changing a string.

<Info>
  **Programmatic discovery:** fetch the live list with [`GET /v1/models`](/api/list-models). Each model object includes `capabilities` (vision, tools, thinking, streaming) and `provider_info` (status, upstream API).
</Info>

## Providers

<CardGroup cols={3}>
  <Card title="OpenAI" icon="bot">
    `openai/gpt-5.2`
  </Card>

  <Card title="Anthropic" icon="brain">
    `anthropic/claude-opus-4-6`
  </Card>

  <Card title="Google" icon="sparkles">
    `google/gemini-3-pro-preview`
  </Card>

  <Card title="xAI" icon="x">
    `xai/grok-4-1-fast-reasoning`
  </Card>

  <Card title="DeepSeek" icon="search">
    `deepseek/deepseek-reasoner`
  </Card>

  <Card title="Mistral" icon="wind">
    `mistral/mistral-large-latest`
  </Card>

  <Card title="Groq" icon="zap">
    `groq/llama-3.3-70b-versatile`
  </Card>

  <Card title="Cerebras" icon="cpu">
    `cerebras/llama-3.3-70b`
  </Card>

  <Card title="Moonshot" icon="moon">
    `moonshot/kimi-k2.5`
  </Card>

  <Card title="Perplexity" icon="circle-help">
    `perplexity/sonar-pro`
  </Card>

  <Card title="Fireworks" icon="flame">
    `fireworks/llama-v3p1-405b`
  </Card>

  <Card title="Cohere" icon="message-square">
    `cohere/command-r-plus`
  </Card>
</CardGroup>

## Model catalog

### OpenAI

#### Chat

* `openai/gpt-5.2`
* `openai/gpt-5.1`
* `openai/gpt-5`
* `openai/gpt-5-mini`
* `openai/gpt-5-nano`
* `openai/gpt-5-chat-latest`
* `openai/gpt-4.1`
* `openai/gpt-4.1-mini`
* `openai/gpt-4.1-nano`
* `openai/gpt-4o`
* `openai/gpt-4o-2024-05-13`
* `openai/chatgpt-4o-latest`
* `openai/gpt-4-turbo`
* `openai/gpt-4`
* `openai/gpt-3.5-turbo`

#### Reasoning

* `openai/o1`
* `openai/o3`
* `openai/o3-mini`
* `openai/o4-mini`

#### Image generation

* `openai/dall-e-3`

#### Audio transcription

* `openai/whisper-1`

#### Embeddings

| Model                           | Price              |
| ------------------------------- | ------------------ |
| `openai/text-embedding-3-large` | \$0.13 / 1M tokens |
| `openai/text-embedding-3-small` | \$0.02 / 1M tokens |
| `openai/text-embedding-ada-002` | \$0.10 / 1M tokens |

### Anthropic

#### Claude 4.6

* `anthropic/claude-opus-4-6`

#### Claude 4.5

* `anthropic/claude-opus-4-5`
* `anthropic/claude-sonnet-4-5-20250929`
* `anthropic/claude-haiku-4-5-20251001`

#### Claude 4

* `anthropic/claude-opus-4-1-20250805`
* `anthropic/claude-opus-4-20250514`

#### Claude 3.7

* `anthropic/claude-3-7-sonnet-20250219`

#### Claude 3.5

* `anthropic/claude-3-5-haiku-20241022`

#### Claude 3

* `anthropic/claude-3-haiku-20240307`

### Google

#### Gemini 3

* `google/gemini-3-pro-preview`
* `google/gemini-3-flash-preview`

#### Gemini 2.5

* `google/gemini-2.5-pro`
* `google/gemini-2.5-flash`
* `google/gemini-2.5-flash-lite`

#### Gemini 2.0

* `google/gemini-2.0-flash`
* `google/gemini-2.0-flash-exp`
* `google/gemini-2.0-flash-001`
* `google/gemini-2.0-flash-lite`

#### Embeddings

* `google/text-embedding-004`

### xAI

#### Grok 4

* `xai/grok-4-1-fast-reasoning`
* `xai/grok-4-1-fast-non-reasoning`
* `xai/grok-4-fast-reasoning`
* `xai/grok-4-fast-non-reasoning`
* `xai/grok-code-fast-1`
* `xai/grok-4-0709`

#### Grok 3

* `xai/grok-3`
* `xai/grok-3-mini`

#### Grok 2

* `xai/grok-2-vision-1212`

### DeepSeek

* `deepseek/deepseek-chat`
* `deepseek/deepseek-reasoner`
* `deepseek/deepseek-coder`

### Mistral

* `mistral/mistral-large-latest`
* `mistral/mistral-medium-latest`
* `mistral/mistral-small-latest`
* `mistral/codestral-2508`
* `mistral/open-mistral-nemo-2407`
* `mistral/pixtral-12b`

### Groq

Lightning-fast inference for open-source models.

* `groq/llama-3.1-8b-instant`
* `groq/llama-3.3-70b-versatile`
* `groq/openai/gpt-oss-120b`
* `groq/openai/gpt-oss-20b`
* `groq/whisper-large-v3`
* `groq/whisper-large-v3-turbo`

### Cerebras

Ultra-fast inference on custom silicon.

#### Production

* `cerebras/llama3.1-8b`
* `cerebras/llama-3.3-70b`
* `cerebras/gpt-oss-120b`
* `cerebras/qwen-3-32b`

#### Preview

* `cerebras/qwen-3-235b-a22b-instruct-2507`
* `cerebras/zai-glm-4.7`

### Moonshot

* `moonshot/kimi-k2.5`
* `moonshot/kimi-k2-0905-preview`
* `moonshot/kimi-k2-0711-preview`
* `moonshot/kimi-k2-turbo-preview`
* `moonshot/kimi-k2-thinking`
* `moonshot/kimi-k2-thinking-turbo`

## Recommendations by use case

### Tool calling and function use

* `anthropic/claude-opus-4-6` - strongest tool calling with structured outputs
* `anthropic/claude-sonnet-4-5-20250929` - fast, reliable tool use
* `openai/gpt-5.2` - native function calling with structured responses
* `openai/gpt-4o` - reliable for production tool workflows
* `deepseek/deepseek-chat` - multi-step reasoning with tools

### Coding

* `openai/gpt-5-codex` - purpose-built for code generation
* `deepseek/deepseek-coder` - strong code-focused model
* `anthropic/claude-opus-4-6` - excellent code understanding
* `xai/grok-code-fast-1` - fast code-focused inference
* `mistral/codestral-2508` - open-source coding model

### Reasoning

* `openai/o3` - deep reasoning for complex problems
* `openai/o1` - multi-step chain-of-thought
* `anthropic/claude-opus-4-6` - advanced reasoning capabilities
* `deepseek/deepseek-reasoner` - specialized reasoning model
* `xai/grok-4-1-fast-reasoning` - optimized for reasoning tasks

### Speed and throughput

* `anthropic/claude-haiku-4-5-20251001` - fast Claude at lower cost
* `google/gemini-2.5-flash` - optimized for throughput
* `openai/gpt-5-mini` - lightweight, fast
* `openai/gpt-5-nano` - ultra-fast for simple tasks
* `groq/llama-3.3-70b-versatile` - Groq-accelerated open-source

### Long context

* `google/gemini-3-pro-preview` - 1M+ token context
* `google/gemini-2.5-pro` - extended context with strong reasoning
* `anthropic/claude-opus-4-6` - long-context analysis
* `anthropic/claude-sonnet-4-5-20250929` - fast long-context

### Vision and multimodal

* `openai/gpt-5.2` - strong multimodal capabilities
* `anthropic/claude-opus-4-6` - advanced vision understanding
* `google/gemini-3-pro-preview` - vision plus long context
* `xai/grok-2-vision-1212` - multimodal with Grok
* `openai/gpt-4o` - reliable vision for production

<Tip>
  Most providers ship multiple tiers (mini / standard / pro / opus). Start on the smaller tier, scale up only when the benchmarks justify it.
</Tip>
