Name: Groq API
Brand: Groq
Price: 0.3 USD

Question 1

What is the cheapest Groq model?

Accepted Answer

By output-token price, Llama 3.1 8B Instant is currently the cheapest Groq model at $0.08 per 1M output tokens and $0.05 per 1M input. The cheapest model per token is not always the cheapest per finished task, since a weaker model can need more retries or longer prompts.

Question 2

How is Groq billed?

Accepted Answer

Groq is pay-as-you-go and billed per token: you pay for input tokens (your prompt and context) and output tokens (the response) separately, priced per million tokens. There is no monthly base fee for the API itself.

Question 3

Why is my Groq bill higher than expected?

Accepted Answer

The per-unit rate is only one of the four numbers that set your bill. The others are how much your app uses Groq, how often, and across how many users. Long prompts, retries, multi-step agents, and uncached repeated context all multiply the rate. PrePrice models that usage from your code so the number is real, not a guess.

Question 4

Where do these Groq prices come from?

Accepted Answer

From Groq's official pricing page (groq.com), verified June 14, 2026 and re-checked on a schedule. Pricing changes often, so confirm the live rate before you commit.

Model	Input / 1M	Output / 1M	Cache read / 1M
GPT OSS 20B	$0.075	$0.3	—
GPT OSS Safeguard 20B	$0.075	$0.3	—
GPT OSS 120B	$0.15	$0.6	—
Llama 4 Scout (17Bx16E)	$0.11	$0.34	—
Qwen3 32B	$0.29	$0.59	—
Llama 3.3 70B Versatile	$0.59	$0.79	—
Llama 3.1 8B Instant	$0.05	$0.08	—
Kimi K2	—	$3	$0.5

How much does Groq cost?

Groq pricing breakdown

What does Groq cost inside your app?

Groq cost questions

What is the cheapest Groq model?

How is Groq billed?

Why is my Groq bill higher than expected?

Where do these Groq prices come from?

Compare Groq with other providers