AIモデルインテリジェンス

機能 · 2026-06-29

ロングコンテキストに対応した AI モデル

200K トークン以上のコンテキストを扱える LLM の比較。

これは何か

  • ロングコンテキスト LLM は一度のプロンプトに 200K トークン以上を受け付けられます —— 書籍一枚、複数ファイルのコードベース、長時間の書き起こしなど。
  • 一部モデルは 1M・2M トークン、さらに大規模まで拡張します。

なぜ重要か

  • RAG の代替または補完になります —— チャンク検索の代わりに全文を貼れる場合があります。
  • 入力が長くなると実効的な想起は低下しやすく、長プロンプトは百万トークン単価で高くなりがちです。
  • 200K 超で段階価格を設けるベンダーもあります —— 詳細は各モデルページの >200K レートを参照してください。

この機能に対応するモデル 509 件

モデルベンダー入力 / 1M出力 / 1Mコンテキストプロバイダー
Ling-2.6-flashopenrouter$0.010$0.030262K1
Google Gemma 3 27B InstructGoogle$0.030$0.110203K10
Qwen3 235B A22B 2507Alibaba (Qwen)$0.071$0.100262K3
Qwen3.5 9BAlibaba (Qwen)$0.040$0.150262K14
Qwen3 235B A22B Instruct 2507Alibaba (Qwen)$0.100$0.100262K16
Qwen3-235B-A22B-Thinking-2507Alibaba (Qwen)$0.100$0.100262K16
Greg 1 Minicrof$0.070$0.150229K1
Qwen3 30B A3B Instruct 2507Alibaba (Qwen)$0.048$0.193262K12
Qwen TurboAlibaba (Qwen)$0.050$0.2001M5
Hy3 previewopenrouter$0.063$0.210262K1
Amazon Nova Lite 1.0nano-gpt$0.059$0.238300K1
Ministral 3 8B 2512Mistral$0.150$0.150262K3
Nova Litevercel$0.060$0.240300K1
Ministral 8Bllmgateway$0.150$0.150262K1
Amazon: Nova Lite 1.0kilo$0.060$0.240300K1
Nova Liteamazon-bedrock$0.060$0.240300K1
Nova Lite 1.0openrouter$0.060$0.240300K1
Laguna XS.2openrouter$0.100$0.200262K1
inclusionAI: Ling-2.6 Flashkilo$0.080$0.240262K1
Ling 2.6 Flashnano-gpt$0.080$0.240262K1
Hy3 previewsiliconflow$0.066$0.260262K1
Tencent: Hy3 Previewkilo$0.066$0.260262K1
Tencent: Hy3 previewnano-gpt$0.066$0.260262K1
GLM-4.7-FlashZ.AI / Zhipu$0.040$0.300200K19
Qwen LongAlibaba (Qwen)$0.072$0.28710M2
Seed 1.6 Flash (250715)llmgateway$0.070$0.300256K1
Gemini 2.0 Flash-LiteGoogle$0.075$0.3001.05M4
ByteDance Seed: Seed 1.6 Flashkilo$0.075$0.300262K1
Seed 1.6 Flashopenrouter$0.075$0.300262K1
Llama 4 ScoutMeta$0.080$0.300328K5
Step 3.5 Flashrouting-run$0.096$0.288262K1
Gemma 4 26B A4B ITGoogle$0.060$0.330262K16
Qwen3 30B A3B Thinking 2507Alibaba (Qwen)$0.051$0.340262K4
Gemma 4 31B ITGoogle$0.100$0.300262K26
Step 3.5 FlashStepFun$0.100$0.300256K11
MiMo-V2-Flashxiaomi$0.100$0.300262K6
Ministral 3 14B 2512Mistral$0.200$0.200262K3
Step 3.5 Flash 2603StepFun$0.100$0.300256K2
Mimo-V2-Flashqiniu-ai$0.100$0.300256K1
MiMo-V2-Flashhuggingface$0.100$0.300262K1
Ling-2.6-flashnovita-ai$0.100$0.300262K1
XiaomiMiMo/MiMo-V2-Flashnovita-ai$0.100$0.300262K1
Step 3.5 Flashstepfun-ai$0.100$0.300256K1
Step 3.5 Flash 2603stepfun-ai$0.100$0.300256K1
Ministral 14Bllmgateway$0.200$0.200262K1
MiMo V2 Flashmeganova$0.100$0.300262K1
Greg (Roleplay)crof$0.100$0.300229K1
Step 3.5 Flash 2603routing-run$0.100$0.300262K1
OWLnano-gpt$0.100$0.3001.05M1
MiMo V2 Flash Originalxiaomi$0.102$0.306256K1
MiMo V2 Flash (Thinking) Originalxiaomi$0.102$0.306256K1
MiMo V2 Flash (Thinking)xiaomi$0.102$0.306256K1
DeepSeek V4 FlashDeepSeek$0.140$0.2801M31
DeepSeek ChatDeepSeek$0.140$0.2801M8
DeepSeek ReasonerDeepSeek$0.140$0.2801M5
MiMo V2.5opencode-go$0.140$0.2801M1
MiMo-V2.5llmgateway$0.140$0.2801M1
Coding Router Lownano-gpt$0.140$0.2801M1
Coding Router Mediumnano-gpt$0.140$0.2801M1
Gemini 2.5 Flash Lite Preview 09-2025Google$0.090$0.3601.05M6

全 509 件中、上位 60 件を表示。 さらに絞り込むには モデル一覧 をご利用ください。

Frequently asked questions

How many AI models support 200K+ コンテキスト?

509 canonical models in our database currently support 200K+ コンテキスト. The list is regenerated on every data refresh, so it always reflects the latest releases tracked in our catalogue.

What is the cheapest model with 200K+ コンテキスト?

Ling-2.6-flash from openrouter is currently the lowest-priced option, at $0.010 per 1M input tokens and $0.030 per 1M output tokens. The full table above is sorted price-ascending.

Which model with 200K+ コンテキスト has the largest context window?

Qwen Long (Alibaba (Qwen)) leads on context at 10M tokens. This may matter if you also need long-document understanding alongside 200K+ コンテキスト.

Which models are available on the most providers?

Production-readiness usually correlates with how many independent providers host the same weights. The top three by provider count are: Kimi K2.6 (49), Kimi K2.5 (48), GLM-5.1 (47).

How is 200K+ コンテキスト different from a regular LLM?

Long-context models accept ≥ 200K input tokens — enough for entire books, codebases or hours of transcripts in one prompt. Effective recall and per-token pricing both degrade with input length, so 'big context' is not always the right choice over RAG.

How often is this list updated?

Daily. Our data pipeline syncs once a day, regenerates the canonical model list, and rebuilds these pages so newly released models appear within 24 hours.

最終更新:

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Pricing and capabilities are refreshed daily and reconciled against each provider's official documentation. Always verify critical production decisions with the provider directly.