能力 · 2026-06-29

支援超長上下文的 AI 模型

對比支援 200K tokens 及以上上下文視窗的 AI 模型 —— 長文件與大規模程式碼場景。

這是什麼？

長上下文 LLM 可一次接受 200K tokens 或更長的輸入 —— 足以裝下整本書、多檔案程式庫或數小時轉寫稿。
部分模型可擴展到 1M、2M 甚至 10M tokens 的上下文。

為什麼重要

長上下文是 RAG 的替代或補充 —— 你可以直接貼上全部內容，而不只檢索片段。
注意：有效召回會隨輸入變長而下降，且按百萬 token 計價會讓長提示很貴。
部分廠商在 200K 以上有階梯價 —— 詳見各模型詳情頁的 >200K 費率。

509 個模型支援此能力

模型	廠商	輸入 / 1M	輸出 / 1M	上下文	服務商
Ling-2.6-flash	openrouter	$0.010	$0.030	262K	1
Google Gemma 3 27B Instruct	Google	$0.030	$0.110	203K	10
Qwen3 235B A22B 2507	Alibaba (Qwen)	$0.071	$0.100	262K	3
Qwen3.5 9B	Alibaba (Qwen)	$0.040	$0.150	262K	14
Qwen3 235B A22B Instruct 2507	Alibaba (Qwen)	$0.100	$0.100	262K	16
Qwen3-235B-A22B-Thinking-2507	Alibaba (Qwen)	$0.100	$0.100	262K	16
Greg 1 Mini	crof	$0.070	$0.150	229K	1
Qwen3 30B A3B Instruct 2507	Alibaba (Qwen)	$0.048	$0.193	262K	12
Qwen Turbo	Alibaba (Qwen)	$0.050	$0.200	1M	5
Hy3 preview	openrouter	$0.063	$0.210	262K	1
Amazon Nova Lite 1.0	nano-gpt	$0.059	$0.238	300K	1
Ministral 3 8B 2512	Mistral	$0.150	$0.150	262K	3
Nova Lite	vercel	$0.060	$0.240	300K	1
Ministral 8B	llmgateway	$0.150	$0.150	262K	1
Amazon: Nova Lite 1.0	kilo	$0.060	$0.240	300K	1
Nova Lite	amazon-bedrock	$0.060	$0.240	300K	1
Nova Lite 1.0	openrouter	$0.060	$0.240	300K	1
Laguna XS.2	openrouter	$0.100	$0.200	262K	1
inclusionAI: Ling-2.6 Flash	kilo	$0.080	$0.240	262K	1
Ling 2.6 Flash	nano-gpt	$0.080	$0.240	262K	1
Hy3 preview	siliconflow	$0.066	$0.260	262K	1
Tencent: Hy3 Preview	kilo	$0.066	$0.260	262K	1
Tencent: Hy3 preview	nano-gpt	$0.066	$0.260	262K	1
GLM-4.7-Flash	Z.AI / Zhipu	$0.040	$0.300	200K	19
Qwen Long	Alibaba (Qwen)	$0.072	$0.287	10M	2
Seed 1.6 Flash (250715)	llmgateway	$0.070	$0.300	256K	1
Gemini 2.0 Flash-Lite	Google	$0.075	$0.300	1.05M	4
ByteDance Seed: Seed 1.6 Flash	kilo	$0.075	$0.300	262K	1
Seed 1.6 Flash	openrouter	$0.075	$0.300	262K	1
Llama 4 Scout	Meta	$0.080	$0.300	328K	5
Step 3.5 Flash	routing-run	$0.096	$0.288	262K	1
Gemma 4 26B A4B IT	Google	$0.060	$0.330	262K	16
Qwen3 30B A3B Thinking 2507	Alibaba (Qwen)	$0.051	$0.340	262K	4
Gemma 4 31B IT	Google	$0.100	$0.300	262K	26
Step 3.5 Flash	StepFun	$0.100	$0.300	256K	11
MiMo-V2-Flash	xiaomi	$0.100	$0.300	262K	6
Ministral 3 14B 2512	Mistral	$0.200	$0.200	262K	3
Step 3.5 Flash 2603	StepFun	$0.100	$0.300	256K	2
Mimo-V2-Flash	qiniu-ai	$0.100	$0.300	256K	1
MiMo-V2-Flash	huggingface	$0.100	$0.300	262K	1
Ling-2.6-flash	novita-ai	$0.100	$0.300	262K	1
XiaomiMiMo/MiMo-V2-Flash	novita-ai	$0.100	$0.300	262K	1
Step 3.5 Flash	stepfun-ai	$0.100	$0.300	256K	1
Step 3.5 Flash 2603	stepfun-ai	$0.100	$0.300	256K	1
Ministral 14B	llmgateway	$0.200	$0.200	262K	1
MiMo V2 Flash	meganova	$0.100	$0.300	262K	1
Greg (Roleplay)	crof	$0.100	$0.300	229K	1
Step 3.5 Flash 2603	routing-run	$0.100	$0.300	262K	1
OWL	nano-gpt	$0.100	$0.300	1.05M	1
MiMo V2 Flash Original	xiaomi	$0.102	$0.306	256K	1
MiMo V2 Flash (Thinking) Original	xiaomi	$0.102	$0.306	256K	1
MiMo V2 Flash (Thinking)	xiaomi	$0.102	$0.306	256K	1
DeepSeek V4 Flash	DeepSeek	$0.140	$0.280	1M	31
DeepSeek Chat	DeepSeek	$0.140	$0.280	1M	8
DeepSeek Reasoner	DeepSeek	$0.140	$0.280	1M	5
MiMo V2.5	opencode-go	$0.140	$0.280	1M	1
MiMo-V2.5	llmgateway	$0.140	$0.280	1M	1
Coding Router Low	nano-gpt	$0.140	$0.280	1M	1
Coding Router Medium	nano-gpt	$0.140	$0.280	1M	1
Gemini 2.5 Flash Lite Preview 09-2025	Google	$0.090	$0.360	1.05M	6

顯示前 60 項，共 509 項。用完整目錄進一步篩選。

Frequently asked questions

How many AI models support 200K+ 上下文視窗?

509 canonical models in our database currently support 200K+ 上下文視窗. The list is regenerated on every data refresh, so it always reflects the latest releases tracked in our catalogue.

What is the cheapest model with 200K+ 上下文視窗?

Ling-2.6-flash from openrouter is currently the lowest-priced option, at $0.010 per 1M input tokens and $0.030 per 1M output tokens. The full table above is sorted price-ascending.

Which model with 200K+ 上下文視窗 has the largest context window?

Qwen Long (Alibaba (Qwen)) leads on context at 10M tokens. This may matter if you also need long-document understanding alongside 200K+ 上下文視窗.

Which models are available on the most providers?

Production-readiness usually correlates with how many independent providers host the same weights. The top three by provider count are: Kimi K2.6 (49), Kimi K2.5 (48), GLM-5.1 (47).

How is 200K+ 上下文視窗 different from a regular LLM?

Long-context models accept ≥ 200K input tokens — enough for entire books, codebases or hours of transcripts in one prompt. Effective recall and per-token pricing both degrade with input length, so 'big context' is not always the right choice over RAG.

How often is this list updated?

Daily. Our data pipeline syncs once a day, regenerates the canonical model list, and rebuilds these pages so newly released models appear within 24 hours.

Top models with this capability

Ling-2.6-flash$0.01 in / $0.03 out
Google Gemma 3 27B Instruct$0.03 in / $0.11 out
Qwen3 235B A22B 2507$0.07 in / $0.10 out
Qwen3.5 9B$0.04 in / $0.15 out
Qwen3 235B A22B Instruct 2507$0.10 in / $0.10 out

Other capabilities

Best-of lists you might also want

Pricing comparisons

Vendors in this list

最近更新： 2026-06-29

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Pricing and capabilities are refreshed daily and reconciled against each provider's official documentation. Always verify critical production decisions with the provider directly.