Complete pricing for GPT-4o, o3, o1, and GPT-3.5 Turbo β with real cost scenarios for apps, chatbots, coding assistants, and AI agents.
OpenAI charges per million tokens (MTok). Input tokens (your prompt) and output tokens (the response) are priced separately. Output tokens are typically 3-4x more expensive.
| Model | Input (per MTok) | Output (per MTok) | Context | Best For |
|---|---|---|---|---|
| GPT-4o Popular | $2.50 | $10.00 | 128K | Most production AI features, vision, complex tasks |
| GPT-4o mini Budget | $0.15 | $0.60 | 128K | High-volume tasks, classification, simple Q&A |
| o3 Reasoning | $10.00 | $40.00 | 200K | Complex math, coding, scientific reasoning |
| o3-mini | $1.10 | $4.40 | 200K | Faster reasoning, coding tasks, cost-efficient |
| o1 | $15.00 | $60.00 | 200K | Deep reasoning, research-grade analysis |
| o1-mini | $3.00 | $12.00 | 128K | STEM reasoning, coding at lower cost |
| GPT-4 Turbo | $10.00 | $30.00 | 128K | Legacy workflows still on GPT-4 |
| GPT-3.5 Turbo Legacy | $0.50 | $1.50 | 16K | Simple tasks, legacy integrations |
The OpenAI Batch API processes requests asynchronously with results in under 24 hours at half the standard price. Ideal for large-scale processing jobs, dataset analysis, embeddings generation, and classification tasks.
| Model | Batch Input (per MTok) | Batch Output (per MTok) | Savings vs Standard |
|---|---|---|---|
| GPT-4o (Batch) | $1.25 | $5.00 | 50% off |
| GPT-4o mini (Batch) | $0.075 | $0.30 | 50% off |
| o3-mini (Batch) | $0.55 | $2.20 | 50% off |
| GPT-3.5 Turbo (Batch) | $0.25 | $0.75 | 50% off |
| Model / Feature | Price | Notes |
|---|---|---|
| text-embedding-3-small | $0.02/MTok | Best for search, RAG, semantic similarity |
| text-embedding-3-large | $0.13/MTok | Higher accuracy, larger dimensions |
| text-embedding-ada-002 | $0.10/MTok | Legacy model, use 3-small instead |
| Whisper (Speech-to-Text) | $0.006/min | Audio transcription and translation |
| TTS (Text-to-Speech) | $15/MTok (chars) | 6 voices, HD option at $30/MTok |
| DALLΒ·E 3 | $0.04β$0.12/image | Varies by quality (standard/HD) and size |
| Use Case | Recommended Model | Why |
|---|---|---|
| High-volume chatbot, Q&A, extraction | GPT-4o mini | 17x cheaper than GPT-4o, handles most tasks well |
| Complex reasoning, multi-step agents | GPT-4o | Best capability-to-cost ratio for demanding tasks |
| Math, science, competitive coding | o3-mini | Purpose-built for reasoning, much cheaper than o3 |
| Research-grade analysis, PhD-level tasks | o3 | Highest capability, justify cost for precision-critical work |
| Large-scale offline batch processing | GPT-4o mini (Batch) | 50% cheaper with async API β best for ETL, classification |
| Semantic search, RAG retrieval | text-embedding-3-small | Best accuracy per dollar, 5x cheaper than ada-002 |
| Voice apps, transcription | Whisper + TTS | $0.006/min audio + $15/MTok for speech synthesis |
OpenAI has consistently reduced prices as efficiency improves. Prices have dropped by 95%+ since GPT-4 launched in 2023.
| Date | Model / Change | Impact |
|---|---|---|
| Mar 2023 | GPT-4 launches | $30/$60 per MTok β most expensive model at launch |
| Nov 2023 | GPT-4 Turbo launches | $10/$30 per MTok β 67% cheaper than original GPT-4 |
| Jan 2024 | GPT-3.5 Turbo price cut | Input cut from $1.00 to $0.50 per MTok (50% off) |
| May 2024 | GPT-4o launches | $2.50/$10 per MTok β 75% cheaper than GPT-4 Turbo |
| Jul 2024 | GPT-4o mini launches | $0.15/$0.60 β replaces GPT-3.5 Turbo at lower price with better capability |
| Sep 2024 | o1 launches | $15/$60 per MTok β premium reasoning model, new category |
| Apr 2025 | o3 launches | $10/$40 per MTok β most capable model, better value than o1 |
| Ongoing | Prompt Caching (all models) | 50% off on cached input prefix β automatic discount for long prompts |
Both OpenAI and Anthropic offer competitive API pricing. Here's a direct comparison of flagship models:
| Model | Input (per MTok) | Output (per MTok) | Context |
|---|---|---|---|
| GPT-4o (OpenAI) | $2.50 | $10.00 | 128K |
| Claude 3.5 Sonnet (Anthropic) | $3.00 | $15.00 | 200K |
| GPT-4o mini (OpenAI) | $0.15 | $0.60 | 128K |
| Claude 3.5 Haiku (Anthropic) | $0.80 | $4.00 | 200K |
| o3 (OpenAI) | $10.00 | $40.00 | 200K |
| Claude 3 Opus (Anthropic) | $15.00 | $75.00 | 200K |
For a full comparison including benchmark performance and real-world use cases, see the OpenAI vs Claude API pricing comparison.
OpenAI has changed prices 8+ times since 2023. Set up instant alerts so you're never caught off guard during budget planning.
Set Up Price Alerts β Free Free API AccessRelated Reading