Model Providers - Dedalus Docs

OpenAI

OPENAI_API_KEY

Anthropic

ANTHROPIC_API_KEY

Google Gemini

GOOGLE_API_KEY

Fireworks AI

FIREWORKS_API_KEY

xAI

XAI_API_KEY

Perplexity

PERPLEXITY_API_KEY

DeepSeek

DEEPSEEK_API_KEY

Groq

GROQ_API_KEY

Cohere

COHERE_API_KEY

Cerebras

CEREBRAS_API_KEY

Mistral

MISTRAL_API_KEY

Moonshot

MOONSHOT_API_KEY

Model Recommendations by Use Case

Choosing the right model depends on your specific requirements. Here’s a guide to help you select the best provider and model for your needs:

Tool Calling & Function Use

Best for: Building agents and applications that need to call external tools or functions

anthropic/claude-opus-4-5 - Excellent tool calling reliability with structured outputs
anthropic/claude-sonnet-4-5-20250929 - Strong tool use with fast performance
openai/gpt-5 - Native function calling support with structured responses
openai/gpt-4o - Reliable tool calling for production applications
deepseek/deepseek-chat - Advanced tool use with multi-step reasoning

Coding & Development

Best for: Code generation, debugging, and technical implementations

deepseek/deepseek-coder - Purpose-built for coding tasks
openai/gpt-5-codex - Specialized for code generation and completion
anthropic/claude-opus-4-5 - Strong code understanding and generation
anthropic/claude-sonnet-4-5-20250929 - Excellent coding with faster responses
xai/grok-code-fast-1 - Fast code-focused model

Reasoning & Complex Problem Solving

Best for: Mathematical reasoning, logical analysis, and complex decision-making

anthropic/claude-opus-4-5 - Advanced reasoning capabilities
openai/o3 - Deep reasoning for complex problems
openai/o1 - Strong multi-step reasoning
deepseek/deepseek-reasoner - Specialized reasoning model
xai/grok-4-fast-reasoning - Optimized for reasoning tasks

Speed & Efficiency

Best for: High-throughput applications requiring fast responses

anthropic/claude-haiku-4-5-20251001 - Fast performance at lower cost
google/gemini-2.5-flash - Optimized for throughput and low latency
openai/gpt-5-mini - Lightweight, fast model
openai/gpt-5-nano - Ultra-fast for simple tasks
xai/grok-4-fast-non-reasoning - Quick responses without extended reasoning

Long Context Tasks

Best for: Processing large documents, codebases, or extended conversations

google/gemini-2.5-pro - Up to 1M+ token context window
google/gemini-2.0-flash - Large context with fast performance
anthropic/claude-opus-4-5 - Extended context for complex analysis
anthropic/claude-sonnet-4-5-20250929 - Strong long-context capabilities
openai/gpt-4-32k - Extended 32K context window

Vision & Multimodal

Best for: Image understanding, document analysis, and visual tasks

openai/gpt-4o - Strong vision capabilities with chat
anthropic/claude-opus-4-5 - Advanced multimodal understanding
anthropic/claude-sonnet-4-5-20250929 - Multimodal with fast performance
google/gemini-2.5-pro - Advanced vision and multimodal processing
xai/grok-2-vision-1212 - Multimodal understanding

Many providers offer multiple model tiers (e.g., mini, standard, pro, opus) that balance cost, speed, and capability. Start with smaller models for testing and scale up based on your performance requirements.

Supported Models

Programmatic Discovery: Use GET /v1/models to list all hundreds models with capabilities (vision, tools, thinking, streaming) and routing metadata. Perfect for building model selectors or auto-populating dropdowns in tools like n8n.

OpenAI

Chat Models

openai/gpt-5.2
openai/gpt-5.1
openai/gpt-5
openai/gpt-5-mini
openai/gpt-5-nano
openai/gpt-5-chat-latest
openai/gpt-4.1
openai/gpt-4.1-mini
openai/gpt-4.1-nano
openai/gpt-4o
openai/gpt-4o-2024-05-13
openai/gpt-5.2
openai/gpt-4o-search-preview
openai/gpt-4o-mini-search-preview
openai/chatgpt-4o-latest
openai/gpt-4-turbo
openai/gpt-4-turbo-2024-04-09
openai/gpt-4
openai/gpt-4-0125-preview
openai/gpt-4-1106-preview
openai/gpt-4-0613
openai/gpt-3.5-turbo
openai/gpt-3.5-turbo-0125
openai/gpt-3.5-turbo-1106

Reasoning Models

openai/o1
openai/o3
openai/o3-mini
openai/o4-mini

Image Generation

openai/dall-e-3

Audio Transcription

openai/whisper-1

Embedding Models

Model	Price
`openai/text-embedding-3-large`	$0.13 / 1M tokens
`openai/text-embedding-3-small`	$0.02 / 1M tokens
`openai/text-embedding-ada-002`	$0.10 / 1M tokens

Anthropic (Claude)

Claude 4.6 Series

anthropic/claude-opus-4-6

Claude 4.5 Series

anthropic/claude-opus-4-5
anthropic/claude-haiku-4-5-20251001
anthropic/claude-sonnet-4-5-20250929

Claude 4 Series

anthropic/claude-opus-4-1-20250805
anthropic/claude-opus-4-20250514
anthropic/claude-opus-4-5

Claude 3.7 Series

anthropic/claude-3-7-sonnet-20250219

Claude 3.5 Series

anthropic/claude-3-5-haiku-20241022

Claude 3 Series

anthropic/claude-3-haiku-20240307

Google (Gemini)

Gemini 3 Series

google/gemini-3-pro-preview
google/gemini-3-flash-preview

Gemini 2.5 Series

google/gemini-2.5-pro
google/gemini-2.5-flash
google/gemini-2.5-flash-lite

Gemini 2.0 Series

google/gemini-2.0-flash
google/gemini-2.0-flash-exp
google/gemini-2.0-flash-001
google/gemini-2.0-flash-lite

Embedding Models

google/text-embedding-004

xAI (Grok)

Grok 4 Series

xai/grok-4-1-fast-reasoning
xai/grok-4-1-fast-non-reasoning
xai/grok-4-fast-reasoning
xai/grok-4-fast-non-reasoning
xai/grok-code-fast-1
xai/grok-4-0709

Grok 3 Series

xai/grok-3
xai/grok-3-mini

Grok 2 Series

xai/grok-2-vision-1212

DeepSeek

deepseek/deepseek-chat
deepseek/deepseek-reasoner
deepseek/deepseek-coder

Mistral

mistral/mistral-large-latest
mistral/mistral-medium-latest
mistral/mistral-small-latest
mistral/codestral-2508
mistral/open-mistral-nemo-2407
mistral/pixtral-12b

Groq

Lightning-fast inference for open source models.

groq/llama-3.1-8b-instant
groq/llama-3.3-70b-versatile
groq/openai/gpt-oss-120b
groq/openai/gpt-oss-20b
groq/whisper-large-v3
groq/whisper-large-v3-turbo

Cerebras

Ultra-fast inference on custom silicon.

Production Models

cerebras/llama3.1-8b
cerebras/llama-3.3-70b
cerebras/gpt-oss-120b
cerebras/qwen-3-32b

Preview Models

cerebras/qwen-3-235b-a22b-instruct-2507
cerebras/zai-glm-4.7

Moonshot (Kimi)

Advanced reasoning and extended context from Moonshot AI.

moonshot/kimi-k2.5
moonshot/kimi-k2-0905-preview
moonshot/kimi-k2-0711-preview
moonshot/kimi-k2-turbo-preview
moonshot/kimi-k2-thinking
moonshot/kimi-k2-thinking-turbo

OpenAI

Anthropic

Google Gemini

Fireworks AI

xAI

Perplexity

DeepSeek

Groq

Cohere

Cerebras

Mistral

Moonshot

​Model Recommendations by Use Case

​Tool Calling & Function Use

​Coding & Development

​Reasoning & Complex Problem Solving

​Speed & Efficiency

​Long Context Tasks

​Vision & Multimodal

​Supported Models

​OpenAI

​Chat Models

​Reasoning Models

​Image Generation

​Audio Transcription

​Embedding Models

​Anthropic (Claude)

​Claude 4.6 Series

​Claude 4.5 Series

​Claude 4 Series

​Claude 3.7 Series

​Claude 3.5 Series

​Claude 3 Series

​Google (Gemini)

​Gemini 3 Series

​Gemini 2.5 Series

​Gemini 2.0 Series

​Embedding Models

​xAI (Grok)

​Grok 4 Series

​Grok 3 Series

​Grok 2 Series

​DeepSeek

​Mistral

​Groq

​Cerebras

​Production Models

​Preview Models

​Moonshot (Kimi)

Model Recommendations by Use Case

Tool Calling & Function Use

Coding & Development

Reasoning & Complex Problem Solving

Speed & Efficiency

Long Context Tasks

Vision & Multimodal

Supported Models

OpenAI

Chat Models

Reasoning Models

Image Generation

Audio Transcription

Embedding Models

Anthropic (Claude)

Claude 4.6 Series

Claude 4.5 Series

Claude 4 Series

Claude 3.7 Series

Claude 3.5 Series

Claude 3 Series

Google (Gemini)

Gemini 3 Series

Gemini 2.5 Series

Gemini 2.0 Series

Embedding Models

xAI (Grok)

Grok 4 Series

Grok 3 Series

Grok 2 Series

DeepSeek

Mistral

Groq

Cerebras

Production Models

Preview Models

Moonshot (Kimi)