Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.dedaluslabs.ai/llms.txt

Use this file to discover all available pages before exploring further.

Dedalus routes to every major LLM provider. Models are addressed with the format provider/model-name, and you switch providers by changing a string.
Programmatic discovery: fetch the live list with GET /v1/models. Each model object includes capabilities (vision, tools, thinking, streaming) and provider_info (status, upstream API).

Providers

OpenAI

openai/gpt-5.2

Anthropic

anthropic/claude-opus-4-6

Google

google/gemini-3-pro-preview

xAI

xai/grok-4-1-fast-reasoning

DeepSeek

deepseek/deepseek-reasoner

Mistral

mistral/mistral-large-latest

Groq

groq/llama-3.3-70b-versatile

Cerebras

cerebras/llama-3.3-70b

Moonshot

moonshot/kimi-k2.5

Perplexity

perplexity/sonar-pro

Fireworks

fireworks/llama-v3p1-405b

Cohere

cohere/command-r-plus

Model catalog

OpenAI

Chat

  • openai/gpt-5.2
  • openai/gpt-5.1
  • openai/gpt-5
  • openai/gpt-5-mini
  • openai/gpt-5-nano
  • openai/gpt-5-chat-latest
  • openai/gpt-4.1
  • openai/gpt-4.1-mini
  • openai/gpt-4.1-nano
  • openai/gpt-4o
  • openai/gpt-4o-2024-05-13
  • openai/chatgpt-4o-latest
  • openai/gpt-4-turbo
  • openai/gpt-4
  • openai/gpt-3.5-turbo

Reasoning

  • openai/o1
  • openai/o3
  • openai/o3-mini
  • openai/o4-mini

Image generation

  • openai/dall-e-3

Audio transcription

  • openai/whisper-1

Embeddings

ModelPrice
openai/text-embedding-3-large$0.13 / 1M tokens
openai/text-embedding-3-small$0.02 / 1M tokens
openai/text-embedding-ada-002$0.10 / 1M tokens

Anthropic

Claude 4.6

  • anthropic/claude-opus-4-6

Claude 4.5

  • anthropic/claude-opus-4-5
  • anthropic/claude-sonnet-4-5-20250929
  • anthropic/claude-haiku-4-5-20251001

Claude 4

  • anthropic/claude-opus-4-1-20250805
  • anthropic/claude-opus-4-20250514

Claude 3.7

  • anthropic/claude-3-7-sonnet-20250219

Claude 3.5

  • anthropic/claude-3-5-haiku-20241022

Claude 3

  • anthropic/claude-3-haiku-20240307

Google

Gemini 3

  • google/gemini-3-pro-preview
  • google/gemini-3-flash-preview

Gemini 2.5

  • google/gemini-2.5-pro
  • google/gemini-2.5-flash
  • google/gemini-2.5-flash-lite

Gemini 2.0

  • google/gemini-2.0-flash
  • google/gemini-2.0-flash-exp
  • google/gemini-2.0-flash-001
  • google/gemini-2.0-flash-lite

Embeddings

  • google/text-embedding-004

xAI

Grok 4

  • xai/grok-4-1-fast-reasoning
  • xai/grok-4-1-fast-non-reasoning
  • xai/grok-4-fast-reasoning
  • xai/grok-4-fast-non-reasoning
  • xai/grok-code-fast-1
  • xai/grok-4-0709

Grok 3

  • xai/grok-3
  • xai/grok-3-mini

Grok 2

  • xai/grok-2-vision-1212

DeepSeek

  • deepseek/deepseek-chat
  • deepseek/deepseek-reasoner
  • deepseek/deepseek-coder

Mistral

  • mistral/mistral-large-latest
  • mistral/mistral-medium-latest
  • mistral/mistral-small-latest
  • mistral/codestral-2508
  • mistral/open-mistral-nemo-2407
  • mistral/pixtral-12b

Groq

Lightning-fast inference for open-source models.
  • groq/llama-3.1-8b-instant
  • groq/llama-3.3-70b-versatile
  • groq/openai/gpt-oss-120b
  • groq/openai/gpt-oss-20b
  • groq/whisper-large-v3
  • groq/whisper-large-v3-turbo

Cerebras

Ultra-fast inference on custom silicon.

Production

  • cerebras/llama3.1-8b
  • cerebras/llama-3.3-70b
  • cerebras/gpt-oss-120b
  • cerebras/qwen-3-32b

Preview

  • cerebras/qwen-3-235b-a22b-instruct-2507
  • cerebras/zai-glm-4.7

Moonshot

  • moonshot/kimi-k2.5
  • moonshot/kimi-k2-0905-preview
  • moonshot/kimi-k2-0711-preview
  • moonshot/kimi-k2-turbo-preview
  • moonshot/kimi-k2-thinking
  • moonshot/kimi-k2-thinking-turbo

Recommendations by use case

Tool calling and function use

  • anthropic/claude-opus-4-6 - strongest tool calling with structured outputs
  • anthropic/claude-sonnet-4-5-20250929 - fast, reliable tool use
  • openai/gpt-5.2 - native function calling with structured responses
  • openai/gpt-4o - reliable for production tool workflows
  • deepseek/deepseek-chat - multi-step reasoning with tools

Coding

  • openai/gpt-5-codex - purpose-built for code generation
  • deepseek/deepseek-coder - strong code-focused model
  • anthropic/claude-opus-4-6 - excellent code understanding
  • xai/grok-code-fast-1 - fast code-focused inference
  • mistral/codestral-2508 - open-source coding model

Reasoning

  • openai/o3 - deep reasoning for complex problems
  • openai/o1 - multi-step chain-of-thought
  • anthropic/claude-opus-4-6 - advanced reasoning capabilities
  • deepseek/deepseek-reasoner - specialized reasoning model
  • xai/grok-4-1-fast-reasoning - optimized for reasoning tasks

Speed and throughput

  • anthropic/claude-haiku-4-5-20251001 - fast Claude at lower cost
  • google/gemini-2.5-flash - optimized for throughput
  • openai/gpt-5-mini - lightweight, fast
  • openai/gpt-5-nano - ultra-fast for simple tasks
  • groq/llama-3.3-70b-versatile - Groq-accelerated open-source

Long context

  • google/gemini-3-pro-preview - 1M+ token context
  • google/gemini-2.5-pro - extended context with strong reasoning
  • anthropic/claude-opus-4-6 - long-context analysis
  • anthropic/claude-sonnet-4-5-20250929 - fast long-context

Vision and multimodal

  • openai/gpt-5.2 - strong multimodal capabilities
  • anthropic/claude-opus-4-6 - advanced vision understanding
  • google/gemini-3-pro-preview - vision plus long context
  • xai/grok-2-vision-1212 - multimodal with Grok
  • openai/gpt-4o - reliable vision for production
Most providers ship multiple tiers (mini / standard / pro / opus). Start on the smaller tier, scale up only when the benchmarks justify it.
Last modified on May 2, 2026