Capacité · 2026-06-29

Modèles d'IA à contexte long

Modèles avec une fenêtre de contexte de 200K tokens ou plus.

Qu'est-ce que c'est ?

Les LLMs à contexte long acceptent 200K tokens ou plus dans un seul prompt — livres entiers, dépôts multi-fichiers ou heures de transcription.
Certains modèles montent à 1M, 2M voire plus de tokens de contexte.

Pourquoi c'est important

Le contexte long complète ou remplace le RAG — vous pouvez coller tout le contenu au lieu de ne récupérer que des fragments.
Le rappel effectif peut se dégrader avec la longueur, et les prompts longs deviennent chers au prix par million de tokens.
Certains fournisseurs appliquent des tarifs échelonnés au-delà de 200K — consultez la page de détail de chaque modèle.

509 modèles avec cette capacité

Modèle	Éditeur	Entrée / 1M	Sortie / 1M	Contexte	Fournisseurs
Ling-2.6-flash	openrouter	$0.010	$0.030	262K	1
Google Gemma 3 27B Instruct	Google	$0.030	$0.110	203K	10
Qwen3 235B A22B 2507	Alibaba (Qwen)	$0.071	$0.100	262K	3
Qwen3.5 9B	Alibaba (Qwen)	$0.040	$0.150	262K	14
Qwen3 235B A22B Instruct 2507	Alibaba (Qwen)	$0.100	$0.100	262K	16
Qwen3-235B-A22B-Thinking-2507	Alibaba (Qwen)	$0.100	$0.100	262K	16
Greg 1 Mini	crof	$0.070	$0.150	229K	1
Qwen3 30B A3B Instruct 2507	Alibaba (Qwen)	$0.048	$0.193	262K	12
Qwen Turbo	Alibaba (Qwen)	$0.050	$0.200	1M	5
Hy3 preview	openrouter	$0.063	$0.210	262K	1
Amazon Nova Lite 1.0	nano-gpt	$0.059	$0.238	300K	1
Ministral 3 8B 2512	Mistral	$0.150	$0.150	262K	3
Nova Lite	vercel	$0.060	$0.240	300K	1
Ministral 8B	llmgateway	$0.150	$0.150	262K	1
Amazon: Nova Lite 1.0	kilo	$0.060	$0.240	300K	1
Nova Lite	amazon-bedrock	$0.060	$0.240	300K	1
Nova Lite 1.0	openrouter	$0.060	$0.240	300K	1
Laguna XS.2	openrouter	$0.100	$0.200	262K	1
inclusionAI: Ling-2.6 Flash	kilo	$0.080	$0.240	262K	1
Ling 2.6 Flash	nano-gpt	$0.080	$0.240	262K	1
Hy3 preview	siliconflow	$0.066	$0.260	262K	1
Tencent: Hy3 Preview	kilo	$0.066	$0.260	262K	1
Tencent: Hy3 preview	nano-gpt	$0.066	$0.260	262K	1
GLM-4.7-Flash	Z.AI / Zhipu	$0.040	$0.300	200K	19
Qwen Long	Alibaba (Qwen)	$0.072	$0.287	10M	2
Seed 1.6 Flash (250715)	llmgateway	$0.070	$0.300	256K	1
Gemini 2.0 Flash-Lite	Google	$0.075	$0.300	1.05M	4
ByteDance Seed: Seed 1.6 Flash	kilo	$0.075	$0.300	262K	1
Seed 1.6 Flash	openrouter	$0.075	$0.300	262K	1
Llama 4 Scout	Meta	$0.080	$0.300	328K	5
Step 3.5 Flash	routing-run	$0.096	$0.288	262K	1
Gemma 4 26B A4B IT	Google	$0.060	$0.330	262K	16
Qwen3 30B A3B Thinking 2507	Alibaba (Qwen)	$0.051	$0.340	262K	4
Gemma 4 31B IT	Google	$0.100	$0.300	262K	26
Step 3.5 Flash	StepFun	$0.100	$0.300	256K	11
MiMo-V2-Flash	xiaomi	$0.100	$0.300	262K	6
Ministral 3 14B 2512	Mistral	$0.200	$0.200	262K	3
Step 3.5 Flash 2603	StepFun	$0.100	$0.300	256K	2
Mimo-V2-Flash	qiniu-ai	$0.100	$0.300	256K	1
MiMo-V2-Flash	huggingface	$0.100	$0.300	262K	1
Ling-2.6-flash	novita-ai	$0.100	$0.300	262K	1
XiaomiMiMo/MiMo-V2-Flash	novita-ai	$0.100	$0.300	262K	1
Step 3.5 Flash	stepfun-ai	$0.100	$0.300	256K	1
Step 3.5 Flash 2603	stepfun-ai	$0.100	$0.300	256K	1
Ministral 14B	llmgateway	$0.200	$0.200	262K	1
MiMo V2 Flash	meganova	$0.100	$0.300	262K	1
Greg (Roleplay)	crof	$0.100	$0.300	229K	1
Step 3.5 Flash 2603	routing-run	$0.100	$0.300	262K	1
OWL	nano-gpt	$0.100	$0.300	1.05M	1
MiMo V2 Flash Original	xiaomi	$0.102	$0.306	256K	1
MiMo V2 Flash (Thinking) Original	xiaomi	$0.102	$0.306	256K	1
MiMo V2 Flash (Thinking)	xiaomi	$0.102	$0.306	256K	1
DeepSeek V4 Flash	DeepSeek	$0.140	$0.280	1M	31
DeepSeek Chat	DeepSeek	$0.140	$0.280	1M	8
DeepSeek Reasoner	DeepSeek	$0.140	$0.280	1M	5
MiMo V2.5	opencode-go	$0.140	$0.280	1M	1
MiMo-V2.5	llmgateway	$0.140	$0.280	1M	1
Coding Router Low	nano-gpt	$0.140	$0.280	1M	1
Coding Router Medium	nano-gpt	$0.140	$0.280	1M	1
Gemini 2.5 Flash Lite Preview 09-2025	Google	$0.090	$0.360	1.05M	6

Top 60 sur 509 affichés. Utilisez le répertoire complet pour filtrer davantage.

Frequently asked questions

How many AI models support contexte 200K+?

509 canonical models in our database currently support contexte 200K+. The list is regenerated on every data refresh, so it always reflects the latest releases tracked in our catalogue.

What is the cheapest model with contexte 200K+?

Ling-2.6-flash from openrouter is currently the lowest-priced option, at $0.010 per 1M input tokens and $0.030 per 1M output tokens. The full table above is sorted price-ascending.

Which model with contexte 200K+ has the largest context window?

Qwen Long (Alibaba (Qwen)) leads on context at 10M tokens. This may matter if you also need long-document understanding alongside contexte 200K+.

Which models are available on the most providers?

Production-readiness usually correlates with how many independent providers host the same weights. The top three by provider count are: Kimi K2.6 (49), Kimi K2.5 (48), GLM-5.1 (47).

How is contexte 200K+ different from a regular LLM?

Long-context models accept ≥ 200K input tokens — enough for entire books, codebases or hours of transcripts in one prompt. Effective recall and per-token pricing both degrade with input length, so 'big context' is not always the right choice over RAG.

How often is this list updated?

Daily. Our data pipeline syncs once a day, regenerates the canonical model list, and rebuilds these pages so newly released models appear within 24 hours.

Top models with this capability

Ling-2.6-flash$0.01 in / $0.03 out
Google Gemma 3 27B Instruct$0.03 in / $0.11 out
Qwen3 235B A22B 2507$0.07 in / $0.10 out
Qwen3.5 9B$0.04 in / $0.15 out
Qwen3 235B A22B Instruct 2507$0.10 in / $0.10 out

Other capabilities

Best-of lists you might also want

Pricing comparisons

Vendors in this list

Dernière mise à jour : 2026-06-29

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Pricing and capabilities are refreshed daily and reconciled against each provider's official documentation. Always verify critical production decisions with the provider directly.