AI / LLM · cost

How much does Cohere cost?

Cohere bills per token, split into input (your prompt) and output (the model's reply), priced per million tokens. Across the 6 models tracked here, output runs from Command R+ 04-2024 at $0.6 per 1M output tokens up to Command R+ 08-2024 at $10. Input is cheaper, and cached input cheaper still. The table below has every model's exact rates, verified June 14, 2026.

Last verified June 14, 2026Official Cohere pricing

Cohere pricing breakdown

Per-1M-token pricing by model.
ModelInput / 1MOutput / 1M
Command R+ 08-2024$2.5$10
Command R+ 04-2024$0.15$0.6
Command R 03-2024$0.5$1.5
Command$1$2
Command-light$0.3$0.6
Aya Expanse$0.5$1.5

Current page focuses on enterprise/Model Vault pricing. Per-token prices for legacy models are in the FAQ. Command R+ 08-2024 and Command R 03-2024 prices are listed in the FAQ. Rerank pricing is per search (query + up to 100 docs). Pay-as-you-go production API keys billed monthly or at $250 outstanding balance.

Source: cohere.com · Catalog 2026-06-14.2. Confirm the live rate before you commit.

What does Cohere cost inside your app?

The rates above are per unit. Your bill is those rates times how hard your code leans on Cohere, plus everything around it. PrePrice scans your project, finds where you call Cohere, and computes your real cost per user and what to charge.

Find your real cost — free

Cohere cost questions

What is the cheapest Cohere model?

By output-token price, Command R+ 04-2024 is currently the cheapest Cohere model at $0.6 per 1M output tokens and $0.15 per 1M input. The cheapest model per token is not always the cheapest per finished task, since a weaker model can need more retries or longer prompts.

How is Cohere billed?

Cohere is pay-as-you-go and billed per token: you pay for input tokens (your prompt and context) and output tokens (the response) separately, priced per million tokens. There is no monthly base fee for the API itself.

Why is my Cohere bill higher than expected?

The per-unit rate is only one of the four numbers that set your bill. The others are how much your app uses Cohere, how often, and across how many users. Long prompts, retries, multi-step agents, and uncached repeated context all multiply the rate. PrePrice models that usage from your code so the number is real, not a guess.

Where do these Cohere prices come from?

From Cohere's official pricing page (cohere.com), verified June 14, 2026 and re-checked on a schedule. Pricing changes often, so confirm the live rate before you commit.

Compare Cohere with other providers

See the full AI Cost Index or estimate your monthly bill.