Dedalus routes to every major LLM provider. Models are addressed with the formatDocumentation Index
Fetch the complete documentation index at: https://docs.dedaluslabs.ai/llms.txt
Use this file to discover all available pages before exploring further.
provider/model-name, and you switch providers by changing a string.
Programmatic discovery: fetch the live list with
GET /v1/models. Each model object includes capabilities (vision, tools, thinking, streaming) and provider_info (status, upstream API).Providers
OpenAI
openai/gpt-5.2Anthropic
anthropic/claude-opus-4-6google/gemini-3-pro-previewxAI
xai/grok-4-1-fast-reasoningDeepSeek
deepseek/deepseek-reasonerMistral
mistral/mistral-large-latestGroq
groq/llama-3.3-70b-versatileCerebras
cerebras/llama-3.3-70bMoonshot
moonshot/kimi-k2.5Perplexity
perplexity/sonar-proFireworks
fireworks/llama-v3p1-405bCohere
cohere/command-r-plusModel catalog
OpenAI
Chat
openai/gpt-5.2openai/gpt-5.1openai/gpt-5openai/gpt-5-miniopenai/gpt-5-nanoopenai/gpt-5-chat-latestopenai/gpt-4.1openai/gpt-4.1-miniopenai/gpt-4.1-nanoopenai/gpt-4oopenai/gpt-4o-2024-05-13openai/chatgpt-4o-latestopenai/gpt-4-turboopenai/gpt-4openai/gpt-3.5-turbo
Reasoning
openai/o1openai/o3openai/o3-miniopenai/o4-mini
Image generation
openai/dall-e-3
Audio transcription
openai/whisper-1
Embeddings
| Model | Price |
|---|---|
openai/text-embedding-3-large | $0.13 / 1M tokens |
openai/text-embedding-3-small | $0.02 / 1M tokens |
openai/text-embedding-ada-002 | $0.10 / 1M tokens |
Anthropic
Claude 4.6
anthropic/claude-opus-4-6
Claude 4.5
anthropic/claude-opus-4-5anthropic/claude-sonnet-4-5-20250929anthropic/claude-haiku-4-5-20251001
Claude 4
anthropic/claude-opus-4-1-20250805anthropic/claude-opus-4-20250514
Claude 3.7
anthropic/claude-3-7-sonnet-20250219
Claude 3.5
anthropic/claude-3-5-haiku-20241022
Claude 3
anthropic/claude-3-haiku-20240307
Gemini 3
google/gemini-3-pro-previewgoogle/gemini-3-flash-preview
Gemini 2.5
google/gemini-2.5-progoogle/gemini-2.5-flashgoogle/gemini-2.5-flash-lite
Gemini 2.0
google/gemini-2.0-flashgoogle/gemini-2.0-flash-expgoogle/gemini-2.0-flash-001google/gemini-2.0-flash-lite
Embeddings
google/text-embedding-004
xAI
Grok 4
xai/grok-4-1-fast-reasoningxai/grok-4-1-fast-non-reasoningxai/grok-4-fast-reasoningxai/grok-4-fast-non-reasoningxai/grok-code-fast-1xai/grok-4-0709
Grok 3
xai/grok-3xai/grok-3-mini
Grok 2
xai/grok-2-vision-1212
DeepSeek
deepseek/deepseek-chatdeepseek/deepseek-reasonerdeepseek/deepseek-coder
Mistral
mistral/mistral-large-latestmistral/mistral-medium-latestmistral/mistral-small-latestmistral/codestral-2508mistral/open-mistral-nemo-2407mistral/pixtral-12b
Groq
Lightning-fast inference for open-source models.groq/llama-3.1-8b-instantgroq/llama-3.3-70b-versatilegroq/openai/gpt-oss-120bgroq/openai/gpt-oss-20bgroq/whisper-large-v3groq/whisper-large-v3-turbo
Cerebras
Ultra-fast inference on custom silicon.Production
cerebras/llama3.1-8bcerebras/llama-3.3-70bcerebras/gpt-oss-120bcerebras/qwen-3-32b
Preview
cerebras/qwen-3-235b-a22b-instruct-2507cerebras/zai-glm-4.7
Moonshot
moonshot/kimi-k2.5moonshot/kimi-k2-0905-previewmoonshot/kimi-k2-0711-previewmoonshot/kimi-k2-turbo-previewmoonshot/kimi-k2-thinkingmoonshot/kimi-k2-thinking-turbo
Recommendations by use case
Tool calling and function use
anthropic/claude-opus-4-6- strongest tool calling with structured outputsanthropic/claude-sonnet-4-5-20250929- fast, reliable tool useopenai/gpt-5.2- native function calling with structured responsesopenai/gpt-4o- reliable for production tool workflowsdeepseek/deepseek-chat- multi-step reasoning with tools
Coding
openai/gpt-5-codex- purpose-built for code generationdeepseek/deepseek-coder- strong code-focused modelanthropic/claude-opus-4-6- excellent code understandingxai/grok-code-fast-1- fast code-focused inferencemistral/codestral-2508- open-source coding model
Reasoning
openai/o3- deep reasoning for complex problemsopenai/o1- multi-step chain-of-thoughtanthropic/claude-opus-4-6- advanced reasoning capabilitiesdeepseek/deepseek-reasoner- specialized reasoning modelxai/grok-4-1-fast-reasoning- optimized for reasoning tasks
Speed and throughput
anthropic/claude-haiku-4-5-20251001- fast Claude at lower costgoogle/gemini-2.5-flash- optimized for throughputopenai/gpt-5-mini- lightweight, fastopenai/gpt-5-nano- ultra-fast for simple tasksgroq/llama-3.3-70b-versatile- Groq-accelerated open-source
Long context
google/gemini-3-pro-preview- 1M+ token contextgoogle/gemini-2.5-pro- extended context with strong reasoninganthropic/claude-opus-4-6- long-context analysisanthropic/claude-sonnet-4-5-20250929- fast long-context
Vision and multimodal
openai/gpt-5.2- strong multimodal capabilitiesanthropic/claude-opus-4-6- advanced vision understandinggoogle/gemini-3-pro-preview- vision plus long contextxai/grok-2-vision-1212- multimodal with Grokopenai/gpt-4o- reliable vision for production
