Models — ArcaneAPI

Orpheus v1

Canopy Labs

audio Active

Directorial TTS with bracketed emotional cues. 100 chars/sec.

Price $0.022 per 1K chars

orpheus-v1

Eleven Multilingual v3

ElevenLabs

audio Active

Expressive TTS with emotional control. Sighs, whispers, cues.

Price $0.12 per 1K chars

eleven-multilingual-v3

GPT Realtime

OpenAI

audio Active

Native bidirectional audio streaming. Sub-200ms latency.

Price $32 / $64 per 1M

gpt-realtime

Whisper Large v3 Turbo

OpenAI

audio Active

228x speed transcription on Groq. Ultra-low cost STT.

Price $0.04 per hour

whisper-large-v3-turbo

Qwen 3 235B

Alibaba

chat Active

Large MoE (22B active) for complex multilingual tasks.

Context 131,072

Price $0.2 / $0.6 per 1M

qwen3-235b-a22b

Qwen 3 32B

Alibaba

chat Active

Dual-mode thinking/non-thinking. 662 TPS on Groq hardware.

Context 131,072

Price $0.29 / $0.59 per 1M

qwen3-32b

Claude Haiku 4.5

Anthropic

chat Active

Ultra-fast responses at low cost. Ideal for high-throughput.

Context 200,000

Price $1 / $5 per 1M

claude-haiku-4-5

Claude Opus 4.6

Anthropic

chat Active

Most powerful Claude. Extended thinking, 1M beta context window.

Context 200,000

Price $5 / $25 per 1M

claude-opus-4-6

Claude Sonnet 4.6

Anthropic

chat Active

Balanced performance and speed for enterprise workloads.

Context 200,000

Price $3 / $15 per 1M

claude-sonnet-4-6

DeepSeek V3.1

DeepSeek

chat Active

671B MoE (37B active). Extreme efficiency with sparse attention.

Context 128,000

Price $0.6 / $1.7 per 1M

deepseek-v3.1

Gemini 2.5 Flash Lite

Google

chat Active

Fastest Gemini variant at near-zero cost.

Context 2,000,000

Price $0.1 / $0.4 per 1M

gemini-2.5-flash-lite

Gemini 2.5 Pro

Google

chat Active

2M context with native Google Search grounding.

Context 2,000,000

Price $1.25 / $10 per 1M

gemini-2.5-pro

Gemini 3.1 Pro

Google

chat Active

Latest Gemini with advanced vibe coding and multimodality.

Context 2,000,000

Price $2 / $12 per 1M

gemini-3.1-pro-preview

Llama 4 Maverick

Llama 4 Scout

Mistral Large 3

Mistral AI

chat Active

Enterprise-grade with 256K context. EU data sovereignty.

Context 256,000

Price $2 / $6 per 1M

mistral-large-latest

Mistral Small 3

Mistral AI

chat Active

Efficient model for routine tasks and high volume.

Context 128,000

Price $0.2 / $0.6 per 1M

mistral-small-3

Kimi K2

Moonshot AI

chat Active

1T MoE (32B active). Excels at frontend dev and tool calling.

Context 262,144

Price $1 / $3 per 1M

kimi-k2-instruct

GPT-4.1

OpenAI

chat Active

Reliable general-purpose model with function calling and vision.

Context 128,000

Price $2 / $8 per 1M

gpt-4.1

GPT-5 Mini

OpenAI

chat Active

Cost-efficient model for high-volume production tasks.

Context 128,000

Price $0.25 / $2 per 1M

gpt-5-mini

GPT-5 Nano

OpenAI

chat Active

Ultra-low cost for triage, extraction, and metadata tasks.

Context 128,000

Price $0.05 / $0.4 per 1M

gpt-5-nano

GPT-5.2

OpenAI

chat Active

Latest OpenAI flagship. 400K context with prompt caching support.

Context 400,000

Price $1.75 / $14 per 1M

gpt-5.2

GPT-OSS 120B

OpenAI

chat Active

Open-weight MoE (5.1B active). Optimized for agentic workflows.

Context 131,072

Price $0.15 / $0.6 per 1M

gpt-oss-120b

GPT-OSS 20B

OpenAI

chat Active

Compact 20B model. Over 1,000 TPS on Groq LPU hardware.

Context 131,072

Price $0.075 / $0.3 per 1M

gpt-oss-20b

Grok 4

xAI

chat Active

2M context with fast reasoning and competitive output pricing.

Context 2,000,000

Price $3 / $12 per 1M

grok-4

Qwen 3 Coder 480B

Alibaba

code Active

Massive 480B coding MoE (35B active). Top benchmark scores.

Context 131,072

Price $2 / $2 per 1M

qwen3-coder-480b-a35b

Codestral

Mistral AI

code Active

Specialized for code generation, completion, and refactoring.

Context 256,000

Price $0.3 / $0.9 per 1M

codestral

Devstral 2

Mistral AI

code Active

Open model tailored for code agents and automation.

Context 128,000

Price $0.2 / $0.6 per 1M

devstral-2

Grok Code Fast

xAI

code Active

Optimized for fast code generation and debugging.

Context 131,072

Price $0.5 / $0.5 per 1M

grok-code-fast-1

Embed English v3

Cohere

embedding Active

1024-dim English embeddings. Native image embedding support.

Context 512

Price $0.1 per 1M tokens

embed-english-v3

Jina Embeddings v5

Jina AI

embedding Active

Task-specific LoRA adapters. 1024 dims, truncatable to 32.

Context 32,768

Price $0.045 per 1M tokens

jina-embeddings-v5

Text Embedding 3 Large

OpenAI

embedding Active

3072-dimensional high-accuracy embeddings.

Context 8,191

Price $0.13 per 1M tokens

text-embedding-3-large

Text Embedding 3 Small

OpenAI

embedding Active

Cost-efficient 1536-dimensional embeddings.

Context 8,191

Price $0.02 per 1M tokens

text-embedding-3-small

Voyage 4 Large

Voyage AI

embedding Active

First MoE embedding model. 1024 dims, Matryoshka support.

Context 32,000

Price $0.12 per 1M tokens

voyage-4-large

Voyage 4 Lite

Voyage AI

embedding Active

Cost-optimized embeddings with flexible dimensions.

Context 32,000

Price $0.02 per 1M tokens

voyage-4-lite

FLUX.1 Pro

Black Forest Labs

image Active

Professional quality generation with high detail.

Price $0.05 per MP

flux-1-pro

FLUX.1 Schnell

Black Forest Labs

image Active

Ultra-fast 4-step generation at minimal cost.

Price $0.003 per MP

flux-1-schnell

FLUX.2 Max

Black Forest Labs

image Active

Premium high-fidelity synthesis. 50 diffusion steps.

Price $0.07 per MP

flux-2-max

Ideogram 3.0

Ideogram

image Active

Best-in-class text rendering accuracy in generated images.

Price $0.06 per image

ideogram-3.0

Kling 2.1 Image

Kling AI

image Active

Text-to-image and multi-image style transfers.

Price $0.014 per image

kling-v2-1-image

GPT Image 1.5

OpenAI

image Active

Tokenized image generation. DALL-E successor with precise prompting.

Context 4,096

Price $5 / $10 per 1M

gpt-image-1.5

Stable Diffusion 3.5 Flash

Stability AI

image Active

Fast distilled generation. 2.5 credits per generation.

Price $0.025 per image

sd3.5-flash

Stable Diffusion 3.5 Large

Stability AI

image Active

High-fidelity diffusion model. 6.5 credits per generation.

Price $0.065 per image

sd3.5-large

Stable Diffusion 3.5 Medium

Stability AI

image Active

Balanced quality and speed. 3.5 credits per generation.

Price $0.035 per image

sd3.5-medium

DeepSeek R1

DeepSeek

reasoning Active

Reasoning specialist with 23K internal thinking tokens. AIME SOTA.

Context 128,000

Price $0.55 / $2.19 per 1M

deepseek-r1

O3

OpenAI

reasoning Active

Advanced reasoning for complex multi-step deduction.

Context 200,000

Price $2 / $8 per 1M

o3

O3 Pro

OpenAI

reasoning Active

Premium reasoning with highest accuracy. Math and crypto focus.

Context 200,000

Price $20 / $80 per 1M

o3-pro

O4 Mini

OpenAI

reasoning Active

Cost-effective reasoning model for everyday logic tasks.

Context 128,000

Price $1.1 / $4.4 per 1M

o4-mini

Veo 3.0

Google

video Active

Advanced video generation architecture via Runway API.

Price $0.4 per second

veo-3.0

Kling 2.1 Pro

Kling AI

video Active

Cinematic realism with complex camera motions. 1080p.

Price $0.05 per second

kling-2.1-pro

Sora 2

OpenAI

video Active

State-of-the-art physics simulation. 720p with synced audio.

Price $0.1 per second

sora-2

Sora 2 Pro

OpenAI

video Active

Premium 1080p video generation with native audio.

Price $0.5 per second

sora-2-pro

Gen-4 Turbo

Runway

video Active

Fast video generation. 5 credits per second.

Price $0.05 per second

gen-4-turbo

Model catalog

Orpheus v1

Eleven Multilingual v3

GPT Realtime

Whisper Large v3 Turbo

Qwen 3 235B

Qwen 3 32B

Claude Haiku 4.5

Claude Opus 4.6

Claude Sonnet 4.6

DeepSeek V3.1

Gemini 2.5 Flash Lite

Gemini 2.5 Pro

Gemini 3.1 Pro

Llama 4 Maverick

Llama 4 Scout

Mistral Large 3

Mistral Small 3

Kimi K2

GPT-4.1

GPT-5 Mini

GPT-5 Nano

GPT-5.2

GPT-OSS 120B

GPT-OSS 20B

Grok 4

Qwen 3 Coder 480B

Codestral

Devstral 2

Grok Code Fast

Embed English v3

Jina Embeddings v5

Text Embedding 3 Large

Text Embedding 3 Small

Voyage 4 Large

Voyage 4 Lite

FLUX.1 Pro

FLUX.1 Schnell

FLUX.2 Max

Ideogram 3.0

Kling 2.1 Image

GPT Image 1.5

Stable Diffusion 3.5 Flash

Stable Diffusion 3.5 Large

Stable Diffusion 3.5 Medium

DeepSeek R1

O3

O3 Pro

O4 Mini

Veo 3.0

Kling 2.1 Pro

Sora 2

Sora 2 Pro

Gen-4 Turbo