Model catalog

53 models available across 19 providers. One API, consistent format.

Orpheus v1

Canopy Labs

audio Active

Directorial TTS with bracketed emotional cues. 100 chars/sec.

Price $0.022 per 1K chars
orpheus-v1

Eleven Multilingual v3

ElevenLabs

audio Active

Expressive TTS with emotional control. Sighs, whispers, cues.

Price $0.12 per 1K chars
eleven-multilingual-v3

GPT Realtime

OpenAI

audio Active

Native bidirectional audio streaming. Sub-200ms latency.

Price $32 / $64 per 1M
gpt-realtime

Whisper Large v3 Turbo

OpenAI

audio Active

228x speed transcription on Groq. Ultra-low cost STT.

Price $0.04 per hour
whisper-large-v3-turbo

Qwen 3 235B

Alibaba

chat Active

Large MoE (22B active) for complex multilingual tasks.

Context 131,072
Price $0.2 / $0.6 per 1M
qwen3-235b-a22b

Qwen 3 32B

Alibaba

chat Active

Dual-mode thinking/non-thinking. 662 TPS on Groq hardware.

Context 131,072
Price $0.29 / $0.59 per 1M
qwen3-32b

Claude Haiku 4.5

Anthropic

chat Active

Ultra-fast responses at low cost. Ideal for high-throughput.

Context 200,000
Price $1 / $5 per 1M
claude-haiku-4-5

Claude Opus 4.6

Anthropic

chat Active

Most powerful Claude. Extended thinking, 1M beta context window.

Context 200,000
Price $5 / $25 per 1M
claude-opus-4-6

Claude Sonnet 4.6

Anthropic

chat Active

Balanced performance and speed for enterprise workloads.

Context 200,000
Price $3 / $15 per 1M
claude-sonnet-4-6

DeepSeek V3.1

DeepSeek

chat Active

671B MoE (37B active). Extreme efficiency with sparse attention.

Context 128,000
Price $0.6 / $1.7 per 1M
deepseek-v3.1

Gemini 2.5 Flash Lite

Google

chat Active

Fastest Gemini variant at near-zero cost.

Context 2,000,000
Price $0.1 / $0.4 per 1M
gemini-2.5-flash-lite

Gemini 2.5 Pro

Google

chat Active

2M context with native Google Search grounding.

Context 2,000,000
Price $1.25 / $10 per 1M
gemini-2.5-pro

Gemini 3.1 Pro

Google

chat Active

Latest Gemini with advanced vibe coding and multimodality.

Context 2,000,000
Price $2 / $12 per 1M
gemini-3.1-pro-preview

Llama 4 Maverick

Meta

chat Active

400B MoE (17B active). Native multimodal, 562 TPS on Groq.

Context 131,072
Price $0.2 / $0.6 per 1M
llama-4-maverick-17b-128e

Llama 4 Scout

Meta

chat Active

109B MoE (17B active). Lean multimodal, near 600 TPS.

Context 131,072
Price $0.11 / $0.34 per 1M
llama-4-scout-17b-16e

Mistral Large 3

Mistral AI

chat Active

Enterprise-grade with 256K context. EU data sovereignty.

Context 256,000
Price $2 / $6 per 1M
mistral-large-latest

Mistral Small 3

Mistral AI

chat Active

Efficient model for routine tasks and high volume.

Context 128,000
Price $0.2 / $0.6 per 1M
mistral-small-3

Kimi K2

Moonshot AI

chat Active

1T MoE (32B active). Excels at frontend dev and tool calling.

Context 262,144
Price $1 / $3 per 1M
kimi-k2-instruct

GPT-4.1

OpenAI

chat Active

Reliable general-purpose model with function calling and vision.

Context 128,000
Price $2 / $8 per 1M
gpt-4.1

GPT-5 Mini

OpenAI

chat Active

Cost-efficient model for high-volume production tasks.

Context 128,000
Price $0.25 / $2 per 1M
gpt-5-mini

GPT-5 Nano

OpenAI

chat Active

Ultra-low cost for triage, extraction, and metadata tasks.

Context 128,000
Price $0.05 / $0.4 per 1M
gpt-5-nano

GPT-5.2

OpenAI

chat Active

Latest OpenAI flagship. 400K context with prompt caching support.

Context 400,000
Price $1.75 / $14 per 1M
gpt-5.2

GPT-OSS 120B

OpenAI

chat Active

Open-weight MoE (5.1B active). Optimized for agentic workflows.

Context 131,072
Price $0.15 / $0.6 per 1M
gpt-oss-120b

GPT-OSS 20B

OpenAI

chat Active

Compact 20B model. Over 1,000 TPS on Groq LPU hardware.

Context 131,072
Price $0.075 / $0.3 per 1M
gpt-oss-20b

Grok 4

xAI

chat Active

2M context with fast reasoning and competitive output pricing.

Context 2,000,000
Price $3 / $12 per 1M
grok-4

Qwen 3 Coder 480B

Alibaba

code Active

Massive 480B coding MoE (35B active). Top benchmark scores.

Context 131,072
Price $2 / $2 per 1M
qwen3-coder-480b-a35b

Codestral

Mistral AI

code Active

Specialized for code generation, completion, and refactoring.

Context 256,000
Price $0.3 / $0.9 per 1M
codestral

Devstral 2

Mistral AI

code Active

Open model tailored for code agents and automation.

Context 128,000
Price $0.2 / $0.6 per 1M
devstral-2

Grok Code Fast

xAI

code Active

Optimized for fast code generation and debugging.

Context 131,072
Price $0.5 / $0.5 per 1M
grok-code-fast-1

Embed English v3

Cohere

embedding Active

1024-dim English embeddings. Native image embedding support.

Context 512
Price $0.1 per 1M tokens
embed-english-v3

Jina Embeddings v5

Jina AI

embedding Active

Task-specific LoRA adapters. 1024 dims, truncatable to 32.

Context 32,768
Price $0.045 per 1M tokens
jina-embeddings-v5

Text Embedding 3 Large

OpenAI

embedding Active

3072-dimensional high-accuracy embeddings.

Context 8,191
Price $0.13 per 1M tokens
text-embedding-3-large

Text Embedding 3 Small

OpenAI

embedding Active

Cost-efficient 1536-dimensional embeddings.

Context 8,191
Price $0.02 per 1M tokens
text-embedding-3-small

Voyage 4 Large

Voyage AI

embedding Active

First MoE embedding model. 1024 dims, Matryoshka support.

Context 32,000
Price $0.12 per 1M tokens
voyage-4-large

Voyage 4 Lite

Voyage AI

embedding Active

Cost-optimized embeddings with flexible dimensions.

Context 32,000
Price $0.02 per 1M tokens
voyage-4-lite

FLUX.1 Pro

Black Forest Labs

image Active

Professional quality generation with high detail.

Price $0.05 per MP
flux-1-pro

FLUX.1 Schnell

Black Forest Labs

image Active

Ultra-fast 4-step generation at minimal cost.

Price $0.003 per MP
flux-1-schnell

FLUX.2 Max

Black Forest Labs

image Active

Premium high-fidelity synthesis. 50 diffusion steps.

Price $0.07 per MP
flux-2-max

Ideogram 3.0

Ideogram

image Active

Best-in-class text rendering accuracy in generated images.

Price $0.06 per image
ideogram-3.0

Kling 2.1 Image

Kling AI

image Active

Text-to-image and multi-image style transfers.

Price $0.014 per image
kling-v2-1-image

GPT Image 1.5

OpenAI

image Active

Tokenized image generation. DALL-E successor with precise prompting.

Context 4,096
Price $5 / $10 per 1M
gpt-image-1.5

Stable Diffusion 3.5 Flash

Stability AI

image Active

Fast distilled generation. 2.5 credits per generation.

Price $0.025 per image
sd3.5-flash

Stable Diffusion 3.5 Large

Stability AI

image Active

High-fidelity diffusion model. 6.5 credits per generation.

Price $0.065 per image
sd3.5-large

Stable Diffusion 3.5 Medium

Stability AI

image Active

Balanced quality and speed. 3.5 credits per generation.

Price $0.035 per image
sd3.5-medium

DeepSeek R1

DeepSeek

reasoning Active

Reasoning specialist with 23K internal thinking tokens. AIME SOTA.

Context 128,000
Price $0.55 / $2.19 per 1M
deepseek-r1

O3

OpenAI

reasoning Active

Advanced reasoning for complex multi-step deduction.

Context 200,000
Price $2 / $8 per 1M
o3

O3 Pro

OpenAI

reasoning Active

Premium reasoning with highest accuracy. Math and crypto focus.

Context 200,000
Price $20 / $80 per 1M
o3-pro

O4 Mini

OpenAI

reasoning Active

Cost-effective reasoning model for everyday logic tasks.

Context 128,000
Price $1.1 / $4.4 per 1M
o4-mini

Veo 3.0

Google

video Active

Advanced video generation architecture via Runway API.

Price $0.4 per second
veo-3.0

Kling 2.1 Pro

Kling AI

video Active

Cinematic realism with complex camera motions. 1080p.

Price $0.05 per second
kling-2.1-pro

Sora 2

OpenAI

video Active

State-of-the-art physics simulation. 720p with synced audio.

Price $0.1 per second
sora-2

Sora 2 Pro

OpenAI

video Active

Premium 1080p video generation with native audio.

Price $0.5 per second
sora-2-pro

Gen-4 Turbo

Runway

video Active

Fast video generation. 5 credits per second.

Price $0.05 per second
gen-4-turbo