All 8 Groq text models with token prices ($ per 1M), context window and max output, refreshed daily from the official pricing source. Bring your Groq key to flo2 and you pay these exact prices — zero markup — with fallback, racing and per-request cost accounting on top.
| Model | Context | Max out | In $/M | Cached in | Out $/M | Reasoning |
|---|---|---|---|---|---|---|
| moonshotai/kimi-k2-instruct-0905 | — | — | $1 | $0.5 | $3 | — |
| llama-3.3-70b-versatile | 128K | — | $0.59 | — | $0.79 | — |
| openai/gpt-oss-120b | 128K | — | $0.15 | $0.075 | $0.6 | ✓ |
| qwen/qwen3-32b | 131K | — | $0.29 | — | $0.59 | ✓ |
| meta-llama/llama-4-scout-17b-16e-instruct | 128K | — | $0.11 | — | $0.34 | — |
| openai/gpt-oss-20b | 128K | — | $0.075 | $0.0375 | $0.3 | ✓ |
| openai/gpt-oss-safeguard-20b | — | — | $0.075 | — | $0.3 | ✓ |
| llama-3.1-8b-instant | 128K | — | $0.05 | — | $0.08 | — |
Prices in USD per 1,000,000 tokens, fetched 2026-06-12 from the official Groq pricing source. Verify before large commitments. Click any column header to sort.
More providers: OpenAI · Anthropic · Google Gemini · xAI Grok · Cerebras · Mistral · DeepInfra · OpenRouter · NVIDIA NIM · or the full cross-provider comparison.