KI‑Modell‑Intelligenz

Tool · 2026-06-29

LLM‑Preisrechner

Geschätzte monatliche Kosten nach Ihrem Token‑Volumen.

Presets:

Total tokens / month: 465,000,000 (360,000,000 in + 105,000,000 out)

#ModelVendorIn $/1MOut $/1MMonthly costvs cheapest
1Meta-Llama-3.1-8B-InstructMeta$0.020$0.030$10.35
2gpt-oss-20bOpenAI$0.029$0.140$25.142.4×
3gpt-oss-120bOpenAI$0.030$0.150$26.552.6×
4Llama-3.3-70B-InstructMeta$0.050$0.230$42.154.1×
5GLM-4.7-FlashZ.AI / Zhipu$0.040$0.300$45.904.4×
6Qwen3 235B A22B Instruct 2507Alibaba (Qwen)$0.100$0.100$46.504.5×
7Qwen3-235B-A22B-Thinking-2507Alibaba (Qwen)$0.100$0.100$46.504.5×
8Gemma 4 26B A4B ITGoogle$0.060$0.330$56.255.4×
9GPT-5 NanoOpenAI$0.050$0.400$60.005.8×
10Gemma 4 31B ITGoogle$0.100$0.300$67.506.5×
11Gemini 2.5 Flash-LiteGoogle$0.100$0.400$78.007.5×
12GPT-4.1 nanoOpenAI$0.100$0.400$78.007.5×
13DeepSeek V4 FlashDeepSeek$0.140$0.280$79.807.7×
14DeepSeek-V3.2DeepSeek$0.180$0.350$101.559.8×
15Qwen3 Coder NextAlibaba (Qwen)$0.108$0.675$109.7610.6×
16GPT-4o miniOpenAI$0.150$0.600$117.0011.3×
17DeepSeek-V3.1DeepSeek$0.200$0.700$145.5014.1×
18GLM-4.5-AirZ.AI / Zhipu$0.200$1.10$187.5018.1×
19GPT-5.4 nanoOpenAI$0.200$1.25$203.2519.6×
20MiniMax-M2.5MiniMax$0.300$1.20$234.0022.6×
21MiniMax-M2.7MiniMax$0.300$1.20$234.0022.6×
22MiniMax-M2.1MiniMax$0.300$1.20$234.0022.6×
23MiniMax-M3MiniMax$0.300$1.20$234.0022.6×
24MiniMax-M2MiniMax$0.300$1.20$234.0022.6×
25Qwen3.6 35B-A3BAlibaba (Qwen)$0.248$1.49$245.2123.7×
26DeepSeek V4 ProDeepSeek$0.435$0.870$247.9524.0×
27GPT-5 MiniOpenAI$0.250$2.00$300.0029.0×
28GPT-5.1 Codex miniOpenAI$0.250$2.00$300.0029.0×
29GPT-4.1 miniOpenAI$0.400$1.60$312.0030.1×
30DeepSeek-R1DeepSeek$0.400$1.70$322.5031.2×
31DeepSeek-R1-0528DeepSeek$0.400$1.70$322.5031.2×
32Gemini 2.5 FlashGoogle$0.300$2.50$370.5035.8×
33Qwen3-Next 80B-A3B InstructAlibaba (Qwen)$0.500$2.00$390.0037.7×
34Qwen3-Coder 30B-A3B InstructAlibaba (Qwen)$0.450$2.25$398.2538.5×
35GLM-4.7Z.AI / Zhipu$0.600$2.20$447.0043.2×
36GLM-4.6Z.AI / Zhipu$0.600$2.20$447.0043.2×
37GLM-4.5Z.AI / Zhipu$0.600$2.20$447.0043.2×
38Kimi K2 ThinkingMoonshot AI$0.600$2.50$478.5046.2×
39Gemini 3 Flash PreviewGoogle$0.500$3.00$495.0047.8×
40Qwen3.6 PlusAlibaba (Qwen)$0.500$3.00$495.0047.8×
41Kimi K2.5Moonshot AI$0.600$3.00$531.0051.3×
42Qwen3 32BAlibaba (Qwen)$0.700$2.80$546.0052.8×
43Qwen3.5 397B-A17BAlibaba (Qwen)$0.600$3.60$594.0057.4×
44GLM-5Z.AI / Zhipu$1.00$3.20$696.0067.2×
45Grok 4.3xAI$1.25$2.50$712.5068.8×
46GPT-5.4 miniOpenAI$0.750$4.50$742.5071.7×
47Kimi K2.6Moonshot AI$0.950$4.00$762.0073.6×
48Kimi K2.7 CodeMoonshot AI$0.950$4.00$762.0073.6×
49o3-miniOpenAI$1.10$4.40$858.0082.9×
50Claude Haiku 4.5 (latest)Anthropic$1.00$5.00$885.0085.5×
51GLM-5.1Z.AI / Zhipu$1.40$4.40$966.0093.3×
52GLM-5.2Z.AI / Zhipu$1.40$4.40$966.0093.3×
53Qwen3-Coder 480B-A35B InstructAlibaba (Qwen)$1.50$7.50$1,328128.3×
54Gemini 3.5 FlashGoogle$1.50$9.00$1,485143.5×
55Gemini 2.5 ProGoogle$1.25$10.00$1,500144.9×
56GPT-5OpenAI$1.25$10.00$1,500144.9×
57GPT-5.1OpenAI$1.25$10.00$1,500144.9×
58GPT-5.1 CodexOpenAI$1.25$10.00$1,500144.9×
59GPT-5-CodexOpenAI$1.25$10.00$1,500144.9×
60GPT-5.1 Codex MaxOpenAI$1.25$10.00$1,500144.9×
61GPT-4.1OpenAI$2.00$8.00$1,560150.7×
62o3OpenAI$2.00$8.00$1,560150.7×
63Qwen3.7 MaxAlibaba (Qwen)$2.50$7.50$1,688163.0×
64GPT-4oOpenAI$2.50$10.00$1,950188.4×
65Gemini 3.1 Pro PreviewGoogle$2.00$12.00$1,980191.3×
66GPT-5.2OpenAI$1.75$14.00$2,100202.9×
67GPT-5.3 CodexOpenAI$1.75$14.00$2,100202.9×
68GPT-5.2 CodexOpenAI$1.75$14.00$2,100202.9×
69Claude Sonnet 4 (latest)Anthropic$2.60$13.00$2,301222.3×
70GPT-5.4OpenAI$2.50$15.00$2,475239.1×
71Claude Sonnet 4.6Anthropic$3.00$15.00$2,655256.5×
72Claude Sonnet 4.5 (latest)Anthropic$3.00$15.00$2,655256.5×
73Claude Opus 4.6Anthropic$5.00$25.00$4,425427.5×
74Claude Opus 4.7Anthropic$5.00$25.00$4,425427.5×
75Claude Opus 4.8Anthropic$5.00$25.00$4,425427.5×
76Claude Opus 4.5 (latest)Anthropic$5.00$25.00$4,425427.5×
77GPT-5.5OpenAI$5.00$30.00$4,950478.3×
78Claude Opus 4.1 (latest)Anthropic$15.00$75.00$13,2751282.6×
79GPT-5 ProOpenAI$15.00$120.00$18,0001739.1×
80GPT-5.4 ProOpenAI$30.00$180.00$29,7002869.6×

So wird gerechnet

monthly_cost = requests × ((avg_input_tokens × input_price + avg_output_tokens × output_price) / 1,000,000).

Numbers shown are estimates only. Real-world cost depends on prompt caching, >200K context tier rates, audio/image surcharges and provider-specific overage. Always confirm with the model detail page or the provider's official pricing.

Frequently asked questions

How accurate are the cost estimates?

We multiply your average input/output token counts by the recommended provider's headline rate from our daily-refreshed canonical catalogue. The math is exact, but the result is only as accurate as your token-count assumptions. Real production cost will also depend on prompt caching, batch-API discounts, >200K-context tier rates and audio/image surcharges — none of which the calculator factors in.

Where do the per-token prices come from?

All prices reflect each provider's published list rate in USD, refreshed daily from our normalised catalogue. They are NOT including taxes, prepaid credits or volume discounts.

Can I share or bookmark a calculation?

Yes. The URL captures your selected model and token settings, so you can link a teammate to a specific scenario without retyping the inputs.

Why are some models missing from the picker?

We only include text-LLMs with a published per-token price. Models without published rates (often invite-only or enterprise-gated) and embedding/audio-only models are excluded — the calculator is text-completion-focused.

How do I estimate cost for caching or long context?

Open the model's detail page — the Pricing detail block lists cache_read / cache_write / context_over_200k rates where applicable. The headline calculator number is a 'no-cache, ≤200K' baseline.

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Pricing and capabilities are refreshed daily and reconciled against each provider's official documentation. Always verify critical production decisions with the provider directly.