AI API Pricing Calculator
Estimate costs across Google Gemini, OpenAI, xAI Grok & Anthropic Claude. Supports text, images, audio & video.
⚙️ Configure Your Workload
📊 Cost Estimate
🔢 Token Breakdown
📏 Context Window Usage
⚡ All Models Compared (Sorted by Cost)
| Provider | Model | Total Tokens | Per Request | Daily | Monthly | + Caching | + Batch |
|---|
Frequently Asked Questions
Costs vary significantly by provider and model. Budget models like Gemini 2.5 Flash-Lite cost $0.10/1M input tokens, while flagship models like GPT-5.5 Pro cost $5.00/1M input tokens. Use our calculator above to compare exact costs for your specific use case.
Token consumption varies by provider. Google Gemini uses 258 tokens per image tile (768×768px). OpenAI charges 85 tokens for low-detail images and 765+ tokens for high-detail. Anthropic Claude estimates roughly (width×height)/750 tokens per image.
Google Gemini processes audio at 32 tokens per second and video at 263 tokens per second. Most other providers (OpenAI, Claude, Grok) do not natively support audio/video input through their standard text APIs — they use separate endpoints with per-minute pricing.
For high-volume simple tasks, Gemini 2.5 Flash-Lite ($0.10/1M input) and GPT-4.1 Nano ($0.10/1M input) are the cheapest options. Grok 4.1 Fast at $0.20/1M input is also very competitive with a massive 2M token context window.
Use Prompt Caching to save up to 90% on repeated input context, and the Batch API (available on most providers) to save 50% on non-urgent workloads. Combining both strategies can reduce costs by up to 95%.
For coding, GPT-4.1 and Gemini 3.1 Pro offer the best balance of capability and cost. For maximum reasoning, o3-Pro ($20/$80 per 1M tokens) and Claude Opus 4.7 ($5/$25) are top choices, though significantly more expensive.