Developer

AI Cost Estimator

5 tabs (Single / Compare / Batch / Scaling / Reference). 24+ models with 2026 pricing, token estimator (English / code / multilingual), break-even analysis between models, monthly volume scaling, multi-currency support (USD / EUR / GBP / TRY), workload presets.

Prices approximate — last updated 2026-04-01

Model & workload

Pick a provider, choose a model, and dial in tokens.

Provider

Model

Input tokens

Roughly 4 characters per English token.

Output tokens

Output is usually 4–5× more expensive than input.

Cached input tokens

Cached tokens are billed at the prompt-cache rate (often ~90% off).

Number of requests

~50% off for non-time-sensitive workloads.

Workload presets

Estimate tokens from text

Cost-saving tips

Use cached inputs — Anthropic prompt caching cuts input cost ~90% on repeated context.
Route routine tasks to smaller models (Haiku, GPT-5 Mini, Gemini Flash).
Batch API is typically 50% off for non-time-sensitive workloads.
Output is 4–5× pricier than input — keep responses concise and use structured formats.
Stream responses if you only need the first tokens to start UI render.
Truncate long context with summarisation passes instead of stuffing everything in.