Google pricing
Gemini 3.0 Flash Cost Calculator
Gemini 3.0 Flash balances speed and capability with 1M context. Cheaper than 2.5 Flash with stronger benchmarks.
Input
$0.25
per 1M tokens
Output
$2.00
per 1M tokens
Context window
1000K
tokens
Released
2026-03
Cutoff 2026-01
≈ Estimated tokenizer·$0.25 in·$2.00 out (per 1M)
Quick start with a use case
Total cost per call$0.001250
Input$0.000250
Output$0.001000
Cost comparison
Standard
$0.001250
With Caching
$0.001156
Save 8% ↓
With Batch
$0.000625
Save 50% ↓
Detailed pricing
Gemini 3.0 Flash pricing breakdown
All pricing dimensions including caching and batch discounts.
| Type | Price (per 1M tokens) |
|---|---|
| Input | $0.2500 |
| Output | $2.0000 |
| Cached input | $0.0625 |
| Batch input | $0.1250 |
| Batch output | $1.0000 |
Last verified 2026-05-01 · Google official pricing
How it compares
Gemini 3.0 Flash vs alternatives
Single-call cost (1000 input + 500 output tokens) ranked from cheapest.
| Model | Per call |
|---|---|
Gemini 3.0 Flash Google · this page | $0.001250 |
| GPT-5 mini OpenAI | $0.000600 |
| DeepSeek V4 DeepSeek | $0.000900 |
| o4-mini OpenAI | $0.002700 |
| Claude Haiku 4.5 Anthropic | $0.003500 |
| Mistral Large 3 Mistral | $0.006250 |
Recommended use
When to choose Gemini 3.0 Flash
Gemini 3.0 Flash shines for general-purpose tasks, image and document understanding, high-throughput, low-latency tasks, and audio understanding and generation. Token counts are estimated within ~10-20% margin.
✓Context window of 1000K tokens handles entire codebases or book-length documents.
✓Prompt caching available — significant savings for repeated system prompts.
✓Batch API support for non-realtime workloads at ~50% discount.
✓Tool / function calling supported.
FAQ
Frequently asked questions
Gemini 3.0 Flash costs $0.25 per 1M input tokens and $2.00 per 1M output tokens. A typical chat call (1000 input + 500 output tokens) costs approximately $0.0013. Use the calculator above to estimate your specific use case.
Related
Other models you might consider
OpenAI
GPT-5 mini
OpenAI's affordable everyday workhorse
$0.20 in$0.80 out
DeepSeek
DeepSeek V4
DeepSeek's open MoE flagship — top open-weight performer
$0.30 in$1.20 out
OpenAI
o4-mini
OpenAI's reasoning model — cheaper than o3
$0.90 in$3.60 out
Anthropic
Claude Haiku 4.5
Anthropic's fast affordable workhorse
$1.00 in$5.00 out
Mistral
Mistral Large 3
Mistral's flagship multilingual open-weight model
$2.50 in$7.50 out