AI 模型情報

能力 · 2026-06-29

支援超長上下文的 AI 模型

對比支援 200K tokens 及以上上下文視窗的 AI 模型 —— 長文件與大規模程式碼場景。

這是什麼?

  • 長上下文 LLM 可一次接受 200K tokens 或更長的輸入 —— 足以裝下整本書、多檔案程式庫或數小時轉寫稿。
  • 部分模型可擴展到 1M、2M 甚至 10M tokens 的上下文。

為什麼重要

  • 長上下文是 RAG 的替代或補充 —— 你可以直接貼上全部內容,而不只檢索片段。
  • 注意:有效召回會隨輸入變長而下降,且按百萬 token 計價會讓長提示很貴。
  • 部分廠商在 200K 以上有階梯價 —— 詳見各模型詳情頁的 >200K 費率。

509 個模型支援此能力

模型廠商輸入 / 1M輸出 / 1M上下文服務商
Ling-2.6-flashopenrouter$0.010$0.030262K1
Google Gemma 3 27B InstructGoogle$0.030$0.110203K10
Qwen3 235B A22B 2507Alibaba (Qwen)$0.071$0.100262K3
Qwen3.5 9BAlibaba (Qwen)$0.040$0.150262K14
Qwen3 235B A22B Instruct 2507Alibaba (Qwen)$0.100$0.100262K16
Qwen3-235B-A22B-Thinking-2507Alibaba (Qwen)$0.100$0.100262K16
Greg 1 Minicrof$0.070$0.150229K1
Qwen3 30B A3B Instruct 2507Alibaba (Qwen)$0.048$0.193262K12
Qwen TurboAlibaba (Qwen)$0.050$0.2001M5
Hy3 previewopenrouter$0.063$0.210262K1
Amazon Nova Lite 1.0nano-gpt$0.059$0.238300K1
Ministral 3 8B 2512Mistral$0.150$0.150262K3
Nova Litevercel$0.060$0.240300K1
Ministral 8Bllmgateway$0.150$0.150262K1
Amazon: Nova Lite 1.0kilo$0.060$0.240300K1
Nova Liteamazon-bedrock$0.060$0.240300K1
Nova Lite 1.0openrouter$0.060$0.240300K1
Laguna XS.2openrouter$0.100$0.200262K1
inclusionAI: Ling-2.6 Flashkilo$0.080$0.240262K1
Ling 2.6 Flashnano-gpt$0.080$0.240262K1
Hy3 previewsiliconflow$0.066$0.260262K1
Tencent: Hy3 Previewkilo$0.066$0.260262K1
Tencent: Hy3 previewnano-gpt$0.066$0.260262K1
GLM-4.7-FlashZ.AI / Zhipu$0.040$0.300200K19
Qwen LongAlibaba (Qwen)$0.072$0.28710M2
Seed 1.6 Flash (250715)llmgateway$0.070$0.300256K1
Gemini 2.0 Flash-LiteGoogle$0.075$0.3001.05M4
ByteDance Seed: Seed 1.6 Flashkilo$0.075$0.300262K1
Seed 1.6 Flashopenrouter$0.075$0.300262K1
Llama 4 ScoutMeta$0.080$0.300328K5
Step 3.5 Flashrouting-run$0.096$0.288262K1
Gemma 4 26B A4B ITGoogle$0.060$0.330262K16
Qwen3 30B A3B Thinking 2507Alibaba (Qwen)$0.051$0.340262K4
Gemma 4 31B ITGoogle$0.100$0.300262K26
Step 3.5 FlashStepFun$0.100$0.300256K11
MiMo-V2-Flashxiaomi$0.100$0.300262K6
Ministral 3 14B 2512Mistral$0.200$0.200262K3
Step 3.5 Flash 2603StepFun$0.100$0.300256K2
Mimo-V2-Flashqiniu-ai$0.100$0.300256K1
MiMo-V2-Flashhuggingface$0.100$0.300262K1
Ling-2.6-flashnovita-ai$0.100$0.300262K1
XiaomiMiMo/MiMo-V2-Flashnovita-ai$0.100$0.300262K1
Step 3.5 Flashstepfun-ai$0.100$0.300256K1
Step 3.5 Flash 2603stepfun-ai$0.100$0.300256K1
Ministral 14Bllmgateway$0.200$0.200262K1
MiMo V2 Flashmeganova$0.100$0.300262K1
Greg (Roleplay)crof$0.100$0.300229K1
Step 3.5 Flash 2603routing-run$0.100$0.300262K1
OWLnano-gpt$0.100$0.3001.05M1
MiMo V2 Flash Originalxiaomi$0.102$0.306256K1
MiMo V2 Flash (Thinking) Originalxiaomi$0.102$0.306256K1
MiMo V2 Flash (Thinking)xiaomi$0.102$0.306256K1
DeepSeek V4 FlashDeepSeek$0.140$0.2801M31
DeepSeek ChatDeepSeek$0.140$0.2801M8
DeepSeek ReasonerDeepSeek$0.140$0.2801M5
MiMo V2.5opencode-go$0.140$0.2801M1
MiMo-V2.5llmgateway$0.140$0.2801M1
Coding Router Lownano-gpt$0.140$0.2801M1
Coding Router Mediumnano-gpt$0.140$0.2801M1
Gemini 2.5 Flash Lite Preview 09-2025Google$0.090$0.3601.05M6

顯示前 60 項,共 509 項。 完整目錄 進一步篩選。

Frequently asked questions

How many AI models support 200K+ 上下文視窗?

509 canonical models in our database currently support 200K+ 上下文視窗. The list is regenerated on every data refresh, so it always reflects the latest releases tracked in our catalogue.

What is the cheapest model with 200K+ 上下文視窗?

Ling-2.6-flash from openrouter is currently the lowest-priced option, at $0.010 per 1M input tokens and $0.030 per 1M output tokens. The full table above is sorted price-ascending.

Which model with 200K+ 上下文視窗 has the largest context window?

Qwen Long (Alibaba (Qwen)) leads on context at 10M tokens. This may matter if you also need long-document understanding alongside 200K+ 上下文視窗.

Which models are available on the most providers?

Production-readiness usually correlates with how many independent providers host the same weights. The top three by provider count are: Kimi K2.6 (49), Kimi K2.5 (48), GLM-5.1 (47).

How is 200K+ 上下文視窗 different from a regular LLM?

Long-context models accept ≥ 200K input tokens — enough for entire books, codebases or hours of transcripts in one prompt. Effective recall and per-token pricing both degrade with input length, so 'big context' is not always the right choice over RAG.

How often is this list updated?

Daily. Our data pipeline syncs once a day, regenerates the canonical model list, and rebuilds these pages so newly released models appear within 24 hours.

最近更新:

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Pricing and capabilities are refreshed daily and reconciled against each provider's official documentation. Always verify critical production decisions with the provider directly.