$2.50 in / $15.00 out
- Context: 1.05M
- Providers: 30
- Tool calling
- Structured output
- Reasoning
- Vision
Best AI models · 2026-06-29
Models with strong tool calling, structured output and large context windows.
$2.50 in / $15.00 out
$5.00 in / $30.00 out
$1.25 in / $10.00 out
$0.300 in / $2.50 out
$0.500 in / $3.00 out
$2.00 in / $12.00 out
$0.100 in / $0.400 out
$1.50 in / $9.00 out
$0.435 in / $0.870 out
$1.40 in / $4.40 out
Same shortlist sliced four ways — pick the tier that matches your budget and constraints.
Lowest total per-1M-token cost in this list ($0.50).
Lowest-cost option that still meets the use case. Pick this when you have high volume or strict unit-economics.
Median price ($10.50) — typically the safest default.
Good-enough quality at a mid-tier price. The default choice for most production apps.
Highest-priced pick in the list ($35.00) — usually the flagship.
Highest-capability model in this list. Pick when accuracy or reasoning matters more than cost.
Open weights and the cheapest in that subset ($1.30).
Open weights — self-host on your own GPUs, fine-tune on private data, run offline. Pricing here reflects the cheapest API host.
Right now we put GPT-5.4 from OpenAI at the top, primarily because it combines tool calling, structured output and a context window large enough to fit real source files. Rankings are recomputed from live model metadata — see "How we picked these" above for the exact rule.
Gemini 2.5 Flash-Lite (Google) is the lowest-priced pick at $0.100 per 1M input tokens and $0.400 per 1M output tokens. Costs from other entries scale up from there.
Each pick comes from a programmatic rule defined in our use-case-rules config: a hard filter (e.g. tool calling required, context ≥ 100K) plus a numeric score combining capability, context window and price. We never hand-curate the order, but we do hand-curate the rule. Underlying model metadata is refreshed daily from a normalised canonical catalogue.
The underlying model data is refreshed once per day, and the static page is rebuilt when the data changes. The 'Last updated' date below shows the most recent rebuild.
Coding and agent workflows almost always need to invoke external tools — the editor, a shell, a test runner, a database. Without first-class function calling, you have to parse free-form text the model emits, which is fragile in production.
Last updated:
Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.
Pricing and capabilities are refreshed daily and reconciled against each provider's official documentation. Always verify critical production decisions with the provider directly.