能力 · 2026-06-29

支持超长上下文的 AI 模型

对比支持 200K tokens 及以上上下文窗口的 AI 模型 —— 长文档与大规模代码场景。

这是什么？

长上下文 LLM 可一次接受 200K tokens 或更长的输入 —— 足以装下整本书、多文件代码库或数小时转写稿。
部分模型可扩展到 1M、2M 甚至 10M tokens 的上下文。

为什么重要

长上下文是 RAG 的替代或补充 —— 你可以直接粘贴全部内容，而不只检索片段。
注意：有效召回会随输入变长而下降，且按百万 token 计价会让长提示很贵。
部分厂商在 200K 以上有阶梯价 —— 详见各模型详情页的 >200K 费率。

509 个模型支持此能力

模型	厂商	输入 / 1M	输出 / 1M	上下文	服务商
Ling-2.6-flash	openrouter	$0.010	$0.030	262K	1
Google Gemma 3 27B Instruct	Google	$0.030	$0.110	203K	10
Qwen3 235B A22B 2507	Alibaba (Qwen)	$0.071	$0.100	262K	3
Qwen3.5 9B	Alibaba (Qwen)	$0.040	$0.150	262K	14
Qwen3 235B A22B Instruct 2507	Alibaba (Qwen)	$0.100	$0.100	262K	16
Qwen3-235B-A22B-Thinking-2507	Alibaba (Qwen)	$0.100	$0.100	262K	16
Greg 1 Mini	crof	$0.070	$0.150	229K	1
Qwen3 30B A3B Instruct 2507	Alibaba (Qwen)	$0.048	$0.193	262K	12
Qwen Turbo	Alibaba (Qwen)	$0.050	$0.200	1M	5
Hy3 preview	openrouter	$0.063	$0.210	262K	1
Amazon Nova Lite 1.0	nano-gpt	$0.059	$0.238	300K	1
Ministral 3 8B 2512	Mistral	$0.150	$0.150	262K	3
Nova Lite	vercel	$0.060	$0.240	300K	1
Ministral 8B	llmgateway	$0.150	$0.150	262K	1
Amazon: Nova Lite 1.0	kilo	$0.060	$0.240	300K	1
Nova Lite	amazon-bedrock	$0.060	$0.240	300K	1
Nova Lite 1.0	openrouter	$0.060	$0.240	300K	1
Laguna XS.2	openrouter	$0.100	$0.200	262K	1
inclusionAI: Ling-2.6 Flash	kilo	$0.080	$0.240	262K	1
Ling 2.6 Flash	nano-gpt	$0.080	$0.240	262K	1
Hy3 preview	siliconflow	$0.066	$0.260	262K	1
Tencent: Hy3 Preview	kilo	$0.066	$0.260	262K	1
Tencent: Hy3 preview	nano-gpt	$0.066	$0.260	262K	1
GLM-4.7-Flash	Z.AI / Zhipu	$0.040	$0.300	200K	19
Qwen Long	Alibaba (Qwen)	$0.072	$0.287	10M	2
Seed 1.6 Flash (250715)	llmgateway	$0.070	$0.300	256K	1
Gemini 2.0 Flash-Lite	Google	$0.075	$0.300	1.05M	4
ByteDance Seed: Seed 1.6 Flash	kilo	$0.075	$0.300	262K	1
Seed 1.6 Flash	openrouter	$0.075	$0.300	262K	1
Llama 4 Scout	Meta	$0.080	$0.300	328K	5
Step 3.5 Flash	routing-run	$0.096	$0.288	262K	1
Gemma 4 26B A4B IT	Google	$0.060	$0.330	262K	16
Qwen3 30B A3B Thinking 2507	Alibaba (Qwen)	$0.051	$0.340	262K	4
Gemma 4 31B IT	Google	$0.100	$0.300	262K	26
Step 3.5 Flash	StepFun	$0.100	$0.300	256K	11
MiMo-V2-Flash	xiaomi	$0.100	$0.300	262K	6
Ministral 3 14B 2512	Mistral	$0.200	$0.200	262K	3
Step 3.5 Flash 2603	StepFun	$0.100	$0.300	256K	2
Mimo-V2-Flash	qiniu-ai	$0.100	$0.300	256K	1
MiMo-V2-Flash	huggingface	$0.100	$0.300	262K	1
Ling-2.6-flash	novita-ai	$0.100	$0.300	262K	1
XiaomiMiMo/MiMo-V2-Flash	novita-ai	$0.100	$0.300	262K	1
Step 3.5 Flash	stepfun-ai	$0.100	$0.300	256K	1
Step 3.5 Flash 2603	stepfun-ai	$0.100	$0.300	256K	1
Ministral 14B	llmgateway	$0.200	$0.200	262K	1
MiMo V2 Flash	meganova	$0.100	$0.300	262K	1
Greg (Roleplay)	crof	$0.100	$0.300	229K	1
Step 3.5 Flash 2603	routing-run	$0.100	$0.300	262K	1
OWL	nano-gpt	$0.100	$0.300	1.05M	1
MiMo V2 Flash Original	xiaomi	$0.102	$0.306	256K	1
MiMo V2 Flash (Thinking) Original	xiaomi	$0.102	$0.306	256K	1
MiMo V2 Flash (Thinking)	xiaomi	$0.102	$0.306	256K	1
DeepSeek V4 Flash	DeepSeek	$0.140	$0.280	1M	31
DeepSeek Chat	DeepSeek	$0.140	$0.280	1M	8
DeepSeek Reasoner	DeepSeek	$0.140	$0.280	1M	5
MiMo V2.5	opencode-go	$0.140	$0.280	1M	1
MiMo-V2.5	llmgateway	$0.140	$0.280	1M	1
Coding Router Low	nano-gpt	$0.140	$0.280	1M	1
Coding Router Medium	nano-gpt	$0.140	$0.280	1M	1
Gemini 2.5 Flash Lite Preview 09-2025	Google	$0.090	$0.360	1.05M	6

显示前 60 / 共 509 项。用完整目录进一步筛选。

Frequently asked questions

How many AI models support 200K+ 上下文窗口?

509 canonical models in our database currently support 200K+ 上下文窗口. The list is regenerated on every data refresh, so it always reflects the latest releases tracked in our catalogue.

What is the cheapest model with 200K+ 上下文窗口?

Ling-2.6-flash from openrouter is currently the lowest-priced option, at $0.010 per 1M input tokens and $0.030 per 1M output tokens. The full table above is sorted price-ascending.

Which model with 200K+ 上下文窗口 has the largest context window?

Qwen Long (Alibaba (Qwen)) leads on context at 10M tokens. This may matter if you also need long-document understanding alongside 200K+ 上下文窗口.

Which models are available on the most providers?

Production-readiness usually correlates with how many independent providers host the same weights. The top three by provider count are: Kimi K2.6 (49), Kimi K2.5 (48), GLM-5.1 (47).

How is 200K+ 上下文窗口 different from a regular LLM?

Long-context models accept ≥ 200K input tokens — enough for entire books, codebases or hours of transcripts in one prompt. Effective recall and per-token pricing both degrade with input length, so 'big context' is not always the right choice over RAG.

How often is this list updated?

Daily. Our data pipeline syncs once a day, regenerates the canonical model list, and rebuilds these pages so newly released models appear within 24 hours.

Top models with this capability

Ling-2.6-flash$0.01 in / $0.03 out
Google Gemma 3 27B Instruct$0.03 in / $0.11 out
Qwen3 235B A22B 2507$0.07 in / $0.10 out
Qwen3.5 9B$0.04 in / $0.15 out
Qwen3 235B A22B Instruct 2507$0.10 in / $0.10 out

Other capabilities

Best-of lists you might also want

Pricing comparisons

Vendors in this list

最近更新： 2026-06-29

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Pricing and capabilities are refreshed daily and reconciled against each provider's official documentation. Always verify critical production decisions with the provider directly.