도구 · 2026-05-22
Context Window Calculator
Convert LLM token budgets into something you can plan around: book pages, English words, lines of code or minutes of transcribed audio.
| Context | English words | Book pages | Code lines | Transcript | CJK chars |
|---|---|---|---|---|---|
| 8K tokens | 6.0K | 23 | 889 | 6.7 min | 8.0K |
| 32K tokens | 24K | 91 | 3.6K | 27 min | 32K |
| 128K tokens | 96K | 366 | 14K | 107 min | 128K |
| 200K tokens | 150K | 571 | 22K | 167 min | 200K |
| 1M tokens | 750K | 2.9K | 111K | 833 min | 1.0M |
| 2M tokens | 1.5M | 5.7K | 222K | 1.7K min | 2.0M |
Conversion assumptions
- 1 token ≈ 0.75 English words or 4 characters (BPE tokenizers)
- 1 standard book page ≈ 350 tokens
- 1 line of typical TypeScript / Python ≈ 9 tokens
- 1 minute of transcribed speech ≈ 1,200 tokens (≈150 wpm × 1.33 tokens/word)
- CJK scripts: ~1 token per character (worst case for non-Latin scripts)
Real-world numbers vary by tokenizer family (GPT-4o, Llama 3, DeepSeek V3 use slightly different vocabularies). Treat these as planning estimates, not exact measurements.
Models per budget tier
≥ 128K tokens
- DeepSeek-V3.2DeepSeek · 128K
- Meta-Llama-3.1-8B-InstructMeta · 128K
- Llama-3.3-70B-InstructMeta · 128K
- GPT-4oOpenAI · 128K
≥ 200K tokens
- GLM-5.1Z.AI / Zhipu · 200K
- Claude Haiku 4.5 (latest)Anthropic · 200K
- GLM-4.7-FlashZ.AI / Zhipu · 200K
- Claude Sonnet 4.5 (latest)Anthropic · 200K
≥ 1M tokens
- DeepSeek V4 ProDeepSeek · 1M
- Claude Opus 4.6Anthropic · 1M
- Claude Sonnet 4.6Anthropic · 1M
- Claude Opus 4.7Anthropic · 1M
≥ 2M tokens
- Grok 4 Fast (Reasoning)xAI · 2M
- xAI Grok 4 Fast Non-ReasoningxAI · 2M
- Grok 4.20 Multi-AgentxAI · 2M
- Grok 4 FastxAI · 2M
See the full ranked table at best long-context LLMs or every model meeting the threshold at /capabilities/long-context.
Frequently asked questions
Why doesn't 1 million tokens equal 1 million words?
Tokens are a model-specific unit roughly equivalent to 0.75 English words or 4 characters. A 1M-token context window holds about 750,000 English words — roughly 1,500 standard book pages.
Does code take more or fewer tokens than English?
More. Source code tokenises at roughly 1 token per 3 characters because programming languages use a lot of punctuation and short keywords that the BPE tokenizer doesn't merge. A 100K-token window typically fits 25,000-30,000 lines of TypeScript or Python.
What about Chinese, Japanese or Korean (CJK) text?
CJK scripts tokenise much less efficiently — typically 1 token per character. A 200K-token window therefore fits about 200,000 CJK characters, not 600 pages.
Should I just use the biggest context window I can?
Not always. Cost scales with input length, and effective recall (retrieving the right detail back) often degrades for content buried in the middle of very long prompts. For most apps, classical RAG over a 200K-window model beats stuffing a 1M window.
Does the context budget include the model's reply?
Yes. Context window is the total of input + output you can fit in one call. The output limit is a separate, smaller cap on how much the model can generate per reply (e.g. 128K context with 16K max output).
Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.
Pricing and capabilities are refreshed daily and reconciled against each provider's official documentation. Always verify critical production decisions with the provider directly.