Model catalog

21 models available across 9 providers. One API, consistent format.

Qwen 3 235B

Alibaba

chat Active

Large MoE (22B active) for complex multilingual tasks.

Context 131,072
Price $0.2 / $0.6 per 1M
qwen3-235b-a22b

Qwen 3 32B

Alibaba

chat Active

Dual-mode thinking/non-thinking. 662 TPS on Groq hardware.

Context 131,072
Price $0.29 / $0.59 per 1M
qwen3-32b

Claude Haiku 4.5

Anthropic

chat Active

Ultra-fast responses at low cost. Ideal for high-throughput.

Context 200,000
Price $1 / $5 per 1M
claude-haiku-4-5

Claude Opus 4.6

Anthropic

chat Active

Most powerful Claude. Extended thinking, 1M beta context window.

Context 200,000
Price $5 / $25 per 1M
claude-opus-4-6

Claude Sonnet 4.6

Anthropic

chat Active

Balanced performance and speed for enterprise workloads.

Context 200,000
Price $3 / $15 per 1M
claude-sonnet-4-6

DeepSeek V3.1

DeepSeek

chat Active

671B MoE (37B active). Extreme efficiency with sparse attention.

Context 128,000
Price $0.6 / $1.7 per 1M
deepseek-v3.1

Gemini 2.5 Flash Lite

Google

chat Active

Fastest Gemini variant at near-zero cost.

Context 2,000,000
Price $0.1 / $0.4 per 1M
gemini-2.5-flash-lite

Gemini 2.5 Pro

Google

chat Active

2M context with native Google Search grounding.

Context 2,000,000
Price $1.25 / $10 per 1M
gemini-2.5-pro

Gemini 3.1 Pro

Google

chat Active

Latest Gemini with advanced vibe coding and multimodality.

Context 2,000,000
Price $2 / $12 per 1M
gemini-3.1-pro-preview

Llama 4 Maverick

Meta

chat Active

400B MoE (17B active). Native multimodal, 562 TPS on Groq.

Context 131,072
Price $0.2 / $0.6 per 1M
llama-4-maverick-17b-128e

Llama 4 Scout

Meta

chat Active

109B MoE (17B active). Lean multimodal, near 600 TPS.

Context 131,072
Price $0.11 / $0.34 per 1M
llama-4-scout-17b-16e

Mistral Large 3

Mistral AI

chat Active

Enterprise-grade with 256K context. EU data sovereignty.

Context 256,000
Price $2 / $6 per 1M
mistral-large-latest

Mistral Small 3

Mistral AI

chat Active

Efficient model for routine tasks and high volume.

Context 128,000
Price $0.2 / $0.6 per 1M
mistral-small-3

Kimi K2

Moonshot AI

chat Active

1T MoE (32B active). Excels at frontend dev and tool calling.

Context 262,144
Price $1 / $3 per 1M
kimi-k2-instruct

GPT-4.1

OpenAI

chat Active

Reliable general-purpose model with function calling and vision.

Context 128,000
Price $2 / $8 per 1M
gpt-4.1

GPT-5 Mini

OpenAI

chat Active

Cost-efficient model for high-volume production tasks.

Context 128,000
Price $0.25 / $2 per 1M
gpt-5-mini

GPT-5 Nano

OpenAI

chat Active

Ultra-low cost for triage, extraction, and metadata tasks.

Context 128,000
Price $0.05 / $0.4 per 1M
gpt-5-nano

GPT-5.2

OpenAI

chat Active

Latest OpenAI flagship. 400K context with prompt caching support.

Context 400,000
Price $1.75 / $14 per 1M
gpt-5.2

GPT-OSS 120B

OpenAI

chat Active

Open-weight MoE (5.1B active). Optimized for agentic workflows.

Context 131,072
Price $0.15 / $0.6 per 1M
gpt-oss-120b

GPT-OSS 20B

OpenAI

chat Active

Compact 20B model. Over 1,000 TPS on Groq LPU hardware.

Context 131,072
Price $0.075 / $0.3 per 1M
gpt-oss-20b

Grok 4

xAI

chat Active

2M context with fast reasoning and competitive output pricing.

Context 2,000,000
Price $3 / $12 per 1M
grok-4