Google Gemini API Pricing 2026

Complete pricing for Gemini 2.5 Pro, Flash, and 1.5 models โ€” with free tier details, real cost scenarios, and comparison to OpenAI and Claude.

๐Ÿ“Š
Monitored by PricePulse. Google has aggressively cut Gemini API prices and expanded free tier limits throughout 2024โ€“2026. We track every change automatically. Last verified: May 2026.

Gemini API Free Tier (Google AI Studio)

Google offers a generous free tier for the Gemini API through Google AI Studio. No credit card required to start.

Model Free Requests/Day Free RPM Limit Free TPM Limit
Gemini 2.5 Flash Popular 1,500/day 10 RPM 1M TPM
Gemini 2.5 Pro Most Capable 50/day 2 RPM 32K TPM
Gemini 1.5 Flash 1,500/day 15 RPM 1M TPM
Gemini 1.5 Pro 50/day 2 RPM 32K TPM
Gemini 1.0 Pro Unlimited 15 RPM 32K TPM
Prototyping tip: For most side projects and early-stage apps under 1,500 requests/day, Gemini 2.5 Flash is completely free. The 1M TPM limit is unusually generous โ€” you can process massive documents without hitting rate limits.

Gemini API Paid Pricing

Once you exceed free tier limits or need higher rate limits, billing is per million tokens. Enable billing in Google AI Studio or use Vertex AI.

Model Input (per MTok) Output (per MTok) Context Window
Gemini 2.5 Flash Popular
Text/image/audio/video
$0.075 $0.30 1M tokens
Gemini 2.5 Flash (Thinking)
Complex reasoning mode
$0.075 $3.50 (thinking tokens) 1M tokens
Gemini 2.5 Pro Most Capable
โ‰ค200K context
$1.25 $10.00 1M tokens
Gemini 2.5 Pro
>200K context
$2.50 $15.00 1M tokens
Gemini 1.5 Flash
โ‰ค128K context
$0.075 $0.30 1M tokens
Gemini 1.5 Pro
โ‰ค128K context
$1.25 $5.00 2M tokens
Gemini Embedding 004 $0.00 N/A 2K tokens/request
Context caching: For applications that repeatedly send the same large system prompt or documents, Gemini Context Caching reduces costs significantly. Cached tokens cost $0.01875/MTok (Flash) or $0.31/MTok (Pro) โ€” 4x cheaper than regular input pricing. Storage: $1.00/MTok/hour (Flash) or $4.50/MTok/hour (Pro).

Gemini API Cost Calculator

Model
Requests per month
Avg input tokens per request
Avg output tokens per request
Estimated monthly API cost $7.50

Real Cost Scenarios

Startup Chatbot โ€” 100K conversations/month (Gemini 2.5 Flash)
100K conversations ร— 600 input tokens avg 60M input tokens
100K conversations ร— 250 output tokens avg 25M output tokens
Flash: $0.075 input + $0.30 output per MTok โ€”
Total monthly API cost ~$12.00
Long Document Analysis โ€” 10K legal/research docs/month (Gemini 2.5 Pro)
10K docs ร— 50,000 tokens avg (long docs) 500M input tokens
10K docs ร— 2,000 output tokens (analysis) 20M output tokens
Pro >200K tier: $2.50 input + $15.00 output โ€”
Total monthly API cost ~$1,550
Early-Stage App (Free Tier) โ€” Under 1,500 req/day
44,000 requests/month (โ‰ˆ1,400/day) Within free limit
Any token amount within rate limits Free
Total monthly API cost $0.00
RAG Application โ€” 200K queries/month with caching (Flash)
System prompt: 5,000 tokens (cached for all requests) Cached input: $0.01875/MTok
200K queries ร— 5,000 cached tokens = 1B cached tokens $18.75
200K queries ร— 300 non-cached input + 400 output tokens $4.50 + $24.00
Total monthly API cost ~$47.25

Gemini vs OpenAI vs Claude API Pricing

Provider / Model Input (per MTok) Output (per MTok) Context
Gemini 2.5 Flash (Google) $0.075 $0.30 1M tokens
GPT-4o mini (OpenAI) $0.15 $0.60 128K
Claude 3.5 Haiku (Anthropic) $0.80 $4.00 200K
Gemini 2.5 Pro (Google) $1.25 $10.00 1M tokens
GPT-4o (OpenAI) $2.50 $10.00 128K
Claude 3.5 Sonnet (Anthropic) $3.00 $15.00 200K
o3 (OpenAI) $10.00 $40.00 200K
Claude 3 Opus (Anthropic) $15.00 $75.00 200K

Gemini Flash is the cheapest option for most tasks โ€” 2x cheaper than GPT-4o mini and 10x cheaper than Claude Haiku. For a full feature and benchmark comparison, see OpenAI vs Claude API pricing.

When to Choose Gemini API

Use Case Recommendation Reason
Side project / prototype Gemini 2.5 Flash (Free) 1,500 req/day free โ€” zero cost to ship v1
Cost-sensitive production app Gemini 2.5 Flash (Paid) $0.075/MTok is best price among major providers
Long document processing Gemini 2.5 Pro 1M token context window handles entire books/codebases
Multimodal (image + video + text) Gemini 2.5 Flash Native multimodal at same price as text-only tasks
Google Cloud / GCP integration Vertex AI (Gemini) Single billing, VPC, enterprise SLA, IAM integration
Enterprise compliance (EU data residency) Vertex AI + region selection Vertex AI supports regional data residency; AI Studio doesn't

Google Gemini API Price History

Date Change Impact
Dec 2023 Gemini Pro launches (API) Free in preview; first public Gemini API access
May 2024 Gemini 1.5 Flash launches $0.35/MTok input โ€” 10x cheaper than Gemini 1.5 Pro
Jul 2024 Gemini 1.5 Flash price cut Cut from $0.35 to $0.075/MTok input โ€” 79% reduction
Sep 2024 Free tier expanded 1,500 req/day free (was 60); 1M TPM free added
Feb 2025 Gemini 2.0 Flash launches Same price as 1.5 Flash, significantly better quality
Mar 2025 Gemini 2.5 Pro launches $1.25/MTok input โ€” top benchmark performance, competitive price
Apr 2025 Gemini 2.5 Flash launches Replaces 2.0 Flash; same price, 2.5 quality with thinking mode
Ongoing Context caching added Cached tokens 4x cheaper โ€” major savings for repeated prompts

Get Alerted When Google Changes Gemini API Prices

Google cut Gemini Flash prices by 79% in a single announcement. Set up instant alerts so you always know when to renegotiate or switch models.

Set Up Price Alerts โ€” Free Free API Access

Related Reading

Related Pricing Pages