On the cheapest model from each, DeepSeek wins on raw output price: deepseek-v4-flash at $0.28 per 1M output tokens versus $1.5. But "cheaper per token" rarely means "cheaper for your app". The model that needs fewer retries, shorter prompts, or less reasoning can cost less in practice even at a higher sticker rate.
Last verified June 14, 2026| Model | In / 1M | Out / 1M |
|---|---|---|
| Gemini 3.5 Flash | $1.5 | $9 |
| Gemini 2.5 Pro (<=200k) | $1.25 | $10 |
| Gemini 2.5 Pro (>200k) | $2.5 | $15 |
| Gemini 2.5 Flash (text/image/video) | $0.3 | — |
| Gemini 2.5 Flash | — | $2.5 |
| Gemini 3.1 Flash-Lite (text/image/video) | $0.25 | — |
| Gemini 3.1 Flash-Lite | — | $1.5 |
| Gemini 3.1 Pro Preview (<=200k) | $2 | — |
| Gemini 3.1 Pro Preview (<=200k, incl. thinking) | — | $12 |
| Gemini 3.1 Pro Preview (>200k) | $4 | — |
| Gemini 3.1 Pro Preview (>200k, incl. thinking) | — | $18 |
| Gemini 3 Flash Preview (text/image/video) | $0.5 | — |
| Gemini 3 Flash Preview | — | $3 |
Per-token rates can't answer that. The winner depends on how your code uses each model. PrePrice scans your project and computes the real per-user cost either way, plus what to charge so you clear margin.
Find your real cost — free