Comparison · 2026-06-29

Gemini 2.5 Flash vs Claude Haiku 4.5 (latest)

Side-by-side comparison of pricing, context window and capabilities for Gemini 2.5 Flash (Google) and Claude Haiku 4.5 (latest) (Anthropic). Focus: cheap vs premium.

	Gemini 2.5 Flash	Claude Haiku 4.5 (latest)
Vendor	Google	Anthropic
Input price / 1M tokens	$0.300	$1.00
Output price / 1M tokens	$2.50	$5.00
Total per 1M (in + out)	$2.80	$6.00
Context window	1.05M	200K
Max output tokens	66K	64K
Tool calling	✓ Yes	✓ Yes
Structured output	✓ Yes	Unknown
Reasoning	✓ Yes	✓ Yes
Vision input	✓ Yes	✓ Yes
Open weights	✗ No	✗ No
Provider availability	27 providers	23 providers
Release date	2025-06-17	2025-10-15
Knowledge cutoff	2025-01	2025-02-28

Quick takeaway

Gemini 2.5 Flash is 2.1× cheaper per 1M tokens (input + output combined).
Gemini 2.5 Flash has a larger context window (1.05M vs 200K).
Only Gemini 2.5 Flash supports structured output / JSON mode.
Gemini 2.5 Flash is more widely available across providers (27 vs 23).

Gemini 2.5 Flash · See all 27 providers →Claude Haiku 4.5 (latest) · See all 23 providers →

How to read this comparison

This page compares Gemini 2.5 Flash and Claude Haiku 4.5 (latest) on the dimensions that matter most for production LLM selection: per-token cost, context window, declared capabilities (tool calling, structured output, reasoning, vision), and provider availability.

Green highlights in the table indicate which model leads on a given row. "Leads" means lower price or higher context/capability — not necessarily "better for your use case". A model that costs 3× more may still be the right choice if it unlocks a capability you need.

What this comparison does NOT tell you

Quality / accuracy — we have no benchmark data. Declared capabilities ≠ measured performance.
Latency — time-to-first-token varies by provider, region and load. Test with your actual traffic.
Prompt caching savings — if you reuse system prompts, the cheaper model on headline rate may not be cheapest in practice.
Fine-tuning availability — not all models can be fine-tuned, even if they are open-weight.

Data is refreshed daily. If a model's capabilities or pricing change, this page updates automatically on the next build cycle.

Frequently asked questions

Is Gemini 2.5 Flash cheaper than Claude Haiku 4.5 (latest)?

Gemini 2.5 Flash costs $0.300/1M tokens input + $2.50/1M tokens output, while Claude Haiku 4.5 (latest) costs $1.00/1M tokens input + $5.00/1M tokens output. Gemini 2.5 Flash is cheaper per combined 1M tokens.

Which model has a longer context window, Gemini 2.5 Flash or Claude Haiku 4.5 (latest)?

Gemini 2.5 Flash supports a longer context window — 1,048,576 tokens vs 200,000 tokens.

Where can I run Gemini 2.5 Flash and Claude Haiku 4.5 (latest)?

Gemini 2.5 Flash is available on 27 providers; Claude Haiku 4.5 (latest) is available on 23 providers. See each model's detail page for the full provider list.

Model details

By vendor

Last updated: 2026-06-29

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Pricing and capabilities are refreshed daily and reconciled against each provider's official documentation. Always verify critical production decisions with the provider directly.