Question 1

Is Groq or Together AI cheaper?

Accepted Answer

On the cheapest model from each, Groq wins on raw output price: Llama 3.1 8B Instant at $0.08 per 1M output tokens versus $0.12. But "cheaper per token" rarely means "cheaper for your app". The model that needs fewer retries, shorter prompts, or less reasoning can cost less in practice even at a higher sticker rate.

Question 2

How should I choose between Groq and Together AI on cost?

Accepted Answer

Don't choose on sticker price alone. The real comparison is total cost for your workload: tokens per request, request volume, caching, and how often each model gets the answer right the first time. Model your actual usage, or scan your code with PrePrice to get the per-user cost for each.

Model	In / 1M	Out / 1M
GPT OSS 20B	$0.075	$0.3
GPT OSS Safeguard 20B	$0.075	$0.3
GPT OSS 120B	$0.15	$0.6
Llama 4 Scout (17Bx16E)	$0.11	$0.34
Qwen3 32B	$0.29	$0.59
Llama 3.3 70B Versatile	$0.59	$0.79
Llama 3.1 8B Instant	$0.05	$0.08
Kimi K2	—	$3

Model	In / 1M	Out / 1M
Llama 3.3 70B	$3.5	$3.5
DeepSeek-R1-0528	$0.18	$0.88
DeepSeek V4 Pro	$0.6	$4.4
DeepSeek-V3.1	$0.6	$1.7
Qwen3.5-397B-A17B	$0.6	$3.6
Qwen3 235B A22B FP8 Throughput	$0.2	$0.6
Qwen3.6-Plus	$0.5	$3
Kimi K2.6	$1.2	$4.5
Kimi K2.5	$0.5	$2.8
MiniMax M2.7	$0.3	$1.2
GLM-5.1	$1.4	$4.4
gpt-oss-120B	$0.15	$0.6
gpt-oss-20B	$0.05	$0.2
LFM2 24B A2B	$0.03	$0.12
Gemma 4 31B	$0.2	$0.5
Gemma 3n E4B Instruct	$0.06	$0.12
Qwen3.5 9B	$0.1	$0.15
GLM-5	$1	$3.2
Qwen3-Coder-Next	$0.5	$1.2
MiniMax M2.5	$0.3	$1.2

Groq vs Together AI: which is cheaper?

Groq

Together AI

Groq or Together AI: which is cheaper for YOUR app?

Other comparisons