KI‑Modell‑Intelligenz

Tool · 2026-05-12

LLM‑Preisrechner

Geschätzte monatliche Kosten nach Ihrem Token‑Volumen.

Presets:

Total tokens / month: 465,000,000 (360,000,000 in + 105,000,000 out)

#ModelVendorIn $/1MOut $/1MMonthly costvs cheapest
1Meta-Llama-3.1-8B-InstructMeta$0.020$0.030$10.35
2Gemma 3 27BGoogle$0.027$0.109$21.212.0×
3GPT OSS 20BOpenAI$0.030$0.140$25.502.5×
4GPT OSS 120BOpenAI$0.040$0.160$31.203.0×
5Llama-3.3-70B-InstructMeta$0.050$0.230$42.154.1×
6Qwen3 235B A22B Instruct 2507Alibaba (Qwen)$0.100$0.100$46.504.5×
7Qwen3-235B-A22B-Thinking-2507Alibaba (Qwen)$0.100$0.100$46.504.5×
8GPT-5 NanoOpenAI$0.050$0.400$60.005.8×
9GLM-4.7-FlashZ.AI / Zhipu$0.060$0.400$63.606.1×
10Gemini 2.5 Flash LiteGoogle$0.100$0.400$78.007.5×
11GPT-4.1 nanoOpenAI$0.100$0.400$78.007.5×
12DeepSeek V4 FlashDeepSeek$0.140$0.280$79.807.7×
13GPT-4o miniOpenAI$0.150$0.600$117.0011.3×
14Qwen3 Coder NextAlibaba (Qwen)$0.120$0.750$121.9511.8×
15Grok 4.1 Fast (Non-Reasoning)xAI$0.200$0.500$124.5012.0×
16DeepSeek-V3.2DeepSeek$0.260$0.380$133.5012.9×
17DeepSeek-V3.1DeepSeek$0.200$0.700$145.5014.1×
18DeepSeek V3.1 TerminusDeepSeek$0.250$0.700$163.5015.8×
19DeepSeek-V3-0324DeepSeek$0.250$0.700$163.5015.8×
20GLM-4.5-AirZ.AI / Zhipu$0.200$1.10$187.5018.1×
21DeepSeek-V3.1DeepSeek$0.250$1.00$195.0018.8×
22GLM-4.6VZ.AI / Zhipu$0.300$0.900$202.5019.6×
23GPT-5.4 nanoOpenAI$0.200$1.25$203.2519.6×
24Grok Code Fast 1xAI$0.200$1.50$229.5022.2×
25MiniMax-M2.5MiniMax$0.300$1.20$234.0022.6×
26MiniMax-M2.1MiniMax$0.300$1.20$234.0022.6×
27MiniMax-M2.7MiniMax$0.300$1.20$234.0022.6×
28MiniMax-M2MiniMax$0.300$1.20$234.0022.6×
29Qwen3.6 PlusAlibaba (Qwen)$0.276$1.65$272.7226.3×
30GPT-5 MiniOpenAI$0.250$2.00$300.0029.0×
31GPT-5.1 Codex miniOpenAI$0.250$2.00$300.0029.0×
32GPT-4.1 miniOpenAI$0.400$1.60$312.0030.1×
33DeepSeek-R1-0528DeepSeek$0.400$1.70$322.5031.2×
34DeepSeek-R1DeepSeek$0.400$1.70$322.5031.2×
35Gemini 2.5 FlashGoogle$0.300$2.50$370.5035.8×
36Qwen3-Next 80B-A3B InstructAlibaba (Qwen)$0.500$2.00$390.0037.7×
37Qwen3-Coder 30B-A3B InstructAlibaba (Qwen)$0.450$2.25$398.2538.5×
38GLM-4.5VZ.AI / Zhipu$0.600$1.80$405.0039.1×
39GLM-4.7Z.AI / Zhipu$0.600$2.20$447.0043.2×
40GLM-4.6Z.AI / Zhipu$0.600$2.20$447.0043.2×
41GLM-4.5Z.AI / Zhipu$0.600$2.20$447.0043.2×
42Kimi K2 ThinkingMoonshot AI$0.600$2.50$478.5046.2×
43Gemini 3 Flash PreviewGoogle$0.500$3.00$495.0047.8×
44Kimi K2.5Moonshot AI$0.600$3.00$531.0051.3×
45Qwen3 32BAlibaba (Qwen)$0.700$2.80$546.0052.8×
46Qwen3.5 397B-A17BAlibaba (Qwen)$0.600$3.60$594.0057.4×
47GLM-5Z.AI / Zhipu$1.00$3.20$696.0067.2×
48GPT-5.4 miniOpenAI$0.750$4.50$742.5071.7×
49Kimi K2.6Moonshot AI$0.950$4.00$762.0073.6×
50Qwen3-Next 80B-A3B (Thinking)Alibaba (Qwen)$0.500$6.00$810.0078.3×
51o3-miniOpenAI$1.10$4.40$858.0082.9×
52Claude Haiku 4.5 (latest)Anthropic$1.00$5.00$885.0085.5×
53GLM-5.1Z.AI / Zhipu$1.40$4.40$966.0093.3×
54DeepSeek V4 ProDeepSeek$1.74$3.48$991.8095.8×
55Qwen3-Coder 480B-A35B InstructAlibaba (Qwen)$1.50$7.50$1,328128.3×
56Gemini 2.5 ProGoogle$1.25$10.00$1,500144.9×
57GPT-5OpenAI$1.25$10.00$1,500144.9×
58GPT-5.1OpenAI$1.25$10.00$1,500144.9×
59GPT-5.1 CodexOpenAI$1.25$10.00$1,500144.9×
60GPT-5.1 Codex MaxOpenAI$1.25$10.00$1,500144.9×
61GPT-5-CodexOpenAI$1.25$10.00$1,500144.9×
62GPT-4.1OpenAI$2.00$8.00$1,560150.7×
63o3OpenAI$2.00$8.00$1,560150.7×
64Qwen2.5-VL 72B InstructAlibaba (Qwen)$2.80$8.40$1,890182.6×
65GPT-4oOpenAI$2.50$10.00$1,950188.4×
66Gemini 3.1 Pro PreviewGoogle$2.00$12.00$1,980191.3×
67GPT-5.2OpenAI$1.75$14.00$2,100202.9×
68GPT-5.2 CodexOpenAI$1.75$14.00$2,100202.9×
69GPT-5.3 CodexOpenAI$1.75$14.00$2,100202.9×
70Claude Sonnet 4Anthropic$2.60$13.00$2,301222.3×
71GPT-5.4OpenAI$2.50$15.00$2,475239.1×
72Claude Sonnet 4.6Anthropic$3.00$15.00$2,655256.5×
73Claude Sonnet 4.5 (latest)Anthropic$3.00$15.00$2,655256.5×
74Claude Opus 4.6Anthropic$5.00$25.00$4,425427.5×
75Claude Opus 4.7Anthropic$5.00$25.00$4,425427.5×
76Claude Opus 4.5 (latest)Anthropic$5.00$25.00$4,425427.5×
77GPT-5.5OpenAI$5.00$30.00$4,950478.3×
78Claude Opus 4.1 (latest)Anthropic$15.00$75.00$13,2751282.6×
79GPT-5 ProOpenAI$15.00$120.00$18,0001739.1×
80GPT-5.4 ProOpenAI$30.00$180.00$29,7002869.6×

So wird gerechnet

monthly_cost = requests × ((avg_input_tokens × input_price + avg_output_tokens × output_price) / 1,000,000).

Numbers shown are estimates only. Real-world cost depends on prompt caching, >200K context tier rates, audio/image surcharges and provider-specific overage. Always confirm with the model detail page or the provider's official pricing.

Frequently asked questions

How accurate are the cost estimates?

We multiply your average input/output token counts by the recommended provider's headline rate from models.dev. The math is exact, but the result is only as accurate as your token-count assumptions. Real production cost will also depend on prompt caching, batch-API discounts, >200K-context tier rates and audio/image surcharges — none of which the calculator factors in.

Where do the per-token prices come from?

All prices are pulled daily from the public models.dev API and reflect each provider's published list rate in USD. They are NOT including taxes, prepaid credits or volume discounts.

Can I share or bookmark a calculation?

Yes. The URL captures your selected model and token settings, so you can link a teammate to a specific scenario without retyping the inputs.

Why are some models missing from the picker?

We only include text-LLMs with a published per-token price. Models without published rates (often invite-only or enterprise-gated) and embedding/audio-only models are excluded — the calculator is text-completion-focused.

How do I estimate cost for caching or long context?

Open the model's detail page — the Pricing detail block lists cache_read / cache_write / context_over_200k rates where applicable. The headline calculator number is a 'no-cache, ≤200K' baseline.

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Data is sourced from models.dev and normalized for comparison. Prices and capabilities may change. Always verify critical production decisions with the provider's official documentation.