能力 · 2026-05-12
支持超长上下文的 AI 模型
对比支持 200K tokens 及以上上下文窗口的 AI 模型 —— 长文档与大规模代码场景。
这是什么?
- 长上下文 LLM 可一次接受 200K tokens 或更长的输入 —— 足以装下整本书、多文件代码库或数小时转写稿。
- 部分模型可扩展到 1M、2M 甚至 10M tokens 的上下文。
为什么重要
- 长上下文是 RAG 的替代或补充 —— 你可以直接粘贴全部内容,而不只检索片段。
- 注意:有效召回会随输入变长而下降,且按百万 token 计价会让长提示很贵。
- 部分厂商在 200K 以上有阶梯价 —— 详见各模型详情页的 >200K 费率。
397 个模型支持此能力
显示前 60 / 共 397 项。 用 完整目录 进一步筛选。
Frequently asked questions
How many AI models support 200K+ 上下文窗口?
397 canonical models in our database currently support 200K+ 上下文窗口. The list is regenerated on every data refresh, so it always reflects the latest model releases from models.dev.
What is the cheapest model with 200K+ 上下文窗口?
Gemini 1.5 Flash-8B from Google is currently the lowest-priced option, at $0.037 per 1M input tokens and $0.150 per 1M output tokens. The full table above is sorted price-ascending.
Which model with 200K+ 上下文窗口 has the largest context window?
Qwen Long (Alibaba (Qwen)) leads on context at 10M tokens. This may matter if you also need long-document understanding alongside 200K+ 上下文窗口.
Which models are available on the most providers?
Production-readiness usually correlates with how many independent providers host the same weights. The top three by provider count are: Kimi K2.5 (45), MiniMax-M2.5 (40), GLM-5 (38).
How is 200K+ 上下文窗口 different from a regular LLM?
Long-context models accept ≥ 200K input tokens — enough for entire books, codebases or hours of transcripts in one prompt. Effective recall and per-token pricing both degrade with input length, so 'big context' is not always the right choice over RAG.
How often is this list updated?
Daily. Our data pipeline pulls models.dev once a day, regenerates the canonical model list, and rebuilds these pages so newly released models appear within 24 hours.
Explore more
Top models with this capability
- Gemini 1.5 Flash-8B$0.04 in / $0.15 out
- Qwen3 235B A22B Instruct 2507$0.10 in / $0.10 out
- Qwen3-235B-A22B-Thinking-2507$0.10 in / $0.10 out
- Qwen3 30B A3B Instruct 2507$0.10 in / $0.10 out
- Qwen3 30B A3B Thinking 2507$0.10 in / $0.10 out
Other capabilities
Best-of lists you might also want
Pricing comparisons
Vendors in this list
最近更新:
Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.
Data is sourced from models.dev and normalized for comparison. Prices and capabilities may change. Always verify critical production decisions with the provider's official documentation.