Cerebras pricing · auto-updated daily

Cerebras API pricing — June 12, 2026

All 2 Cerebras text models with token prices ($ per 1M), context window and max output, refreshed daily from the official pricing source. Bring your Cerebras key to flo2 and you pay these exact prices — zero markup — with fallback, racing and per-request cost accounting on top.

Cheapest output
$0.75/M
gpt-oss-120b
Cheapest input
$0.35/M
gpt-oss-120b
Biggest context
131K tok
zai-glm-4.7
Models
2
tracked daily
ModelContextMax outIn $/MCached inOut $/MReasoning
zai-glm-4.7 131K 40K $2.25 $2.75
gpt-oss-120b 131K 40K $0.35 $0.75

Prices in USD per 1,000,000 tokens, fetched 2026-06-12 from the official Cerebras pricing source. Verify before large commitments. Click any column header to sort.

More providers: OpenAI · Anthropic · Google Gemini · xAI Grok · Groq · Mistral · DeepInfra · OpenRouter · NVIDIA NIM · or the full cross-provider comparison.

Use Cerebras through one key — zero markup.
flo2 routes every call to the cheapest, fastest model that clears your bar, with fallback, racing and true cost accounting. Free during Beta.
Get your flo2 key →
© 2026 flo2.com — the zero-markup LLM gateway & router. blog · all providers · flow → to