Models — ArcaneAPI

Qwen 3 235B

Alibaba

chat Active

Large MoE (22B active) for complex multilingual tasks.

Context 131,072

Price $0.2 / $0.6 per 1M

qwen3-235b-a22b

Qwen 3 32B

Alibaba

chat Active

Dual-mode thinking/non-thinking. 662 TPS on Groq hardware.

Context 131,072

Price $0.29 / $0.59 per 1M

qwen3-32b

Claude Haiku 4.5

Anthropic

chat Active

Ultra-fast responses at low cost. Ideal for high-throughput.

Context 200,000

Price $1 / $5 per 1M

claude-haiku-4-5

Claude Opus 4.6

Anthropic

chat Active

Most powerful Claude. Extended thinking, 1M beta context window.

Context 200,000

Price $5 / $25 per 1M

claude-opus-4-6

Claude Sonnet 4.6

Anthropic

chat Active

Balanced performance and speed for enterprise workloads.

Context 200,000

Price $3 / $15 per 1M

claude-sonnet-4-6

DeepSeek V3.1

DeepSeek

chat Active

671B MoE (37B active). Extreme efficiency with sparse attention.

Context 128,000

Price $0.6 / $1.7 per 1M

deepseek-v3.1

Gemini 2.5 Flash Lite

Google

chat Active

Fastest Gemini variant at near-zero cost.

Context 2,000,000

Price $0.1 / $0.4 per 1M

gemini-2.5-flash-lite

Gemini 2.5 Pro

Google

chat Active

2M context with native Google Search grounding.

Context 2,000,000

Price $1.25 / $10 per 1M

gemini-2.5-pro

Gemini 3.1 Pro

Google

chat Active

Latest Gemini with advanced vibe coding and multimodality.

Context 2,000,000

Price $2 / $12 per 1M

gemini-3.1-pro-preview

Llama 4 Maverick

Llama 4 Scout

Mistral Large 3

Mistral AI

chat Active

Enterprise-grade with 256K context. EU data sovereignty.

Context 256,000

Price $2 / $6 per 1M

mistral-large-latest

Mistral Small 3

Mistral AI

chat Active

Efficient model for routine tasks and high volume.

Context 128,000

Price $0.2 / $0.6 per 1M

mistral-small-3

Kimi K2

Moonshot AI

chat Active

1T MoE (32B active). Excels at frontend dev and tool calling.

Context 262,144

Price $1 / $3 per 1M

kimi-k2-instruct

GPT-4.1

OpenAI

chat Active

Reliable general-purpose model with function calling and vision.

Context 128,000

Price $2 / $8 per 1M

gpt-4.1

GPT-5 Mini

OpenAI

chat Active

Cost-efficient model for high-volume production tasks.

Context 128,000

Price $0.25 / $2 per 1M

gpt-5-mini

GPT-5 Nano

OpenAI

chat Active

Ultra-low cost for triage, extraction, and metadata tasks.

Context 128,000

Price $0.05 / $0.4 per 1M

gpt-5-nano

GPT-5.2

OpenAI

chat Active

Latest OpenAI flagship. 400K context with prompt caching support.

Context 400,000

Price $1.75 / $14 per 1M

gpt-5.2

GPT-OSS 120B

OpenAI

chat Active

Open-weight MoE (5.1B active). Optimized for agentic workflows.

Context 131,072

Price $0.15 / $0.6 per 1M

gpt-oss-120b

GPT-OSS 20B

OpenAI

chat Active

Compact 20B model. Over 1,000 TPS on Groq LPU hardware.

Context 131,072

Price $0.075 / $0.3 per 1M

gpt-oss-20b

Grok 4

xAI

chat Active

2M context with fast reasoning and competitive output pricing.

Context 2,000,000

Price $3 / $12 per 1M

grok-4

Model catalog

Qwen 3 235B

Qwen 3 32B

Claude Haiku 4.5

Claude Opus 4.6

Claude Sonnet 4.6

DeepSeek V3.1

Gemini 2.5 Flash Lite

Gemini 2.5 Pro

Gemini 3.1 Pro

Llama 4 Maverick

Llama 4 Scout

Mistral Large 3

Mistral Small 3

Kimi K2

GPT-4.1

GPT-5 Mini

GPT-5 Nano

GPT-5.2

GPT-OSS 120B

GPT-OSS 20B

Grok 4