기능 · 2026-06-29

롱 컨텍스트를 지원하는 AI 모델

200K 토큰 이상의 컨텍스트 윈도우를 가진 모델 비교.

이게 뭔가요?

롱 컨텍스트 LLM은 단일 프롬프트에 200K 토큰 이상을 받을 수 있습니다 — 책 한 권, 멀티파일 저장소, 수 시간 분량의 트랜스크립트 등.
일부 모델은 1M, 2M 토큰 이상으로 확장됩니다.

왜 중요한가

롱 컨텍스트는 RAG를 보완하거나 대체합니다 — 검색된 조각만이 아닌 전체 콘텐츠를 붙여넣을 수 있습니다.
유효 리콜은 길이에 따라 저하될 수 있으며, 긴 프롬프트는 백만 토큰당 가격으로 비용이 높아집니다.
일부 제공사는 200K 이상에 단계별 요금을 적용합니다 — 각 모델 상세 페이지를 확인하세요.

이 기능을 지원하는 모델 509개

모델	벤더	입력 / 1M	출력 / 1M	컨텍스트	제공자
Ling-2.6-flash	openrouter	$0.010	$0.030	262K	1
Google Gemma 3 27B Instruct	Google	$0.030	$0.110	203K	10
Qwen3 235B A22B 2507	Alibaba (Qwen)	$0.071	$0.100	262K	3
Qwen3.5 9B	Alibaba (Qwen)	$0.040	$0.150	262K	14
Qwen3 235B A22B Instruct 2507	Alibaba (Qwen)	$0.100	$0.100	262K	16
Qwen3-235B-A22B-Thinking-2507	Alibaba (Qwen)	$0.100	$0.100	262K	16
Greg 1 Mini	crof	$0.070	$0.150	229K	1
Qwen3 30B A3B Instruct 2507	Alibaba (Qwen)	$0.048	$0.193	262K	12
Qwen Turbo	Alibaba (Qwen)	$0.050	$0.200	1M	5
Hy3 preview	openrouter	$0.063	$0.210	262K	1
Amazon Nova Lite 1.0	nano-gpt	$0.059	$0.238	300K	1
Ministral 3 8B 2512	Mistral	$0.150	$0.150	262K	3
Nova Lite	vercel	$0.060	$0.240	300K	1
Ministral 8B	llmgateway	$0.150	$0.150	262K	1
Amazon: Nova Lite 1.0	kilo	$0.060	$0.240	300K	1
Nova Lite	amazon-bedrock	$0.060	$0.240	300K	1
Nova Lite 1.0	openrouter	$0.060	$0.240	300K	1
Laguna XS.2	openrouter	$0.100	$0.200	262K	1
inclusionAI: Ling-2.6 Flash	kilo	$0.080	$0.240	262K	1
Ling 2.6 Flash	nano-gpt	$0.080	$0.240	262K	1
Hy3 preview	siliconflow	$0.066	$0.260	262K	1
Tencent: Hy3 Preview	kilo	$0.066	$0.260	262K	1
Tencent: Hy3 preview	nano-gpt	$0.066	$0.260	262K	1
GLM-4.7-Flash	Z.AI / Zhipu	$0.040	$0.300	200K	19
Qwen Long	Alibaba (Qwen)	$0.072	$0.287	10M	2
Seed 1.6 Flash (250715)	llmgateway	$0.070	$0.300	256K	1
Gemini 2.0 Flash-Lite	Google	$0.075	$0.300	1.05M	4
ByteDance Seed: Seed 1.6 Flash	kilo	$0.075	$0.300	262K	1
Seed 1.6 Flash	openrouter	$0.075	$0.300	262K	1
Llama 4 Scout	Meta	$0.080	$0.300	328K	5
Step 3.5 Flash	routing-run	$0.096	$0.288	262K	1
Gemma 4 26B A4B IT	Google	$0.060	$0.330	262K	16
Qwen3 30B A3B Thinking 2507	Alibaba (Qwen)	$0.051	$0.340	262K	4
Gemma 4 31B IT	Google	$0.100	$0.300	262K	26
Step 3.5 Flash	StepFun	$0.100	$0.300	256K	11
MiMo-V2-Flash	xiaomi	$0.100	$0.300	262K	6
Ministral 3 14B 2512	Mistral	$0.200	$0.200	262K	3
Step 3.5 Flash 2603	StepFun	$0.100	$0.300	256K	2
Mimo-V2-Flash	qiniu-ai	$0.100	$0.300	256K	1
MiMo-V2-Flash	huggingface	$0.100	$0.300	262K	1
Ling-2.6-flash	novita-ai	$0.100	$0.300	262K	1
XiaomiMiMo/MiMo-V2-Flash	novita-ai	$0.100	$0.300	262K	1
Step 3.5 Flash	stepfun-ai	$0.100	$0.300	256K	1
Step 3.5 Flash 2603	stepfun-ai	$0.100	$0.300	256K	1
Ministral 14B	llmgateway	$0.200	$0.200	262K	1
MiMo V2 Flash	meganova	$0.100	$0.300	262K	1
Greg (Roleplay)	crof	$0.100	$0.300	229K	1
Step 3.5 Flash 2603	routing-run	$0.100	$0.300	262K	1
OWL	nano-gpt	$0.100	$0.300	1.05M	1
MiMo V2 Flash Original	xiaomi	$0.102	$0.306	256K	1
MiMo V2 Flash (Thinking) Original	xiaomi	$0.102	$0.306	256K	1
MiMo V2 Flash (Thinking)	xiaomi	$0.102	$0.306	256K	1
DeepSeek V4 Flash	DeepSeek	$0.140	$0.280	1M	31
DeepSeek Chat	DeepSeek	$0.140	$0.280	1M	8
DeepSeek Reasoner	DeepSeek	$0.140	$0.280	1M	5
MiMo V2.5	opencode-go	$0.140	$0.280	1M	1
MiMo-V2.5	llmgateway	$0.140	$0.280	1M	1
Coding Router Low	nano-gpt	$0.140	$0.280	1M	1
Coding Router Medium	nano-gpt	$0.140	$0.280	1M	1
Gemini 2.5 Flash Lite Preview 09-2025	Google	$0.090	$0.360	1.05M	6

전체 509개 중 상위 60개 표시. 추가 필터링은 전체 목록을 이용하세요.

Frequently asked questions

How many AI models support 200K+ 컨텍스트?

509 canonical models in our database currently support 200K+ 컨텍스트. The list is regenerated on every data refresh, so it always reflects the latest releases tracked in our catalogue.

What is the cheapest model with 200K+ 컨텍스트?

Ling-2.6-flash from openrouter is currently the lowest-priced option, at $0.010 per 1M input tokens and $0.030 per 1M output tokens. The full table above is sorted price-ascending.

Which model with 200K+ 컨텍스트 has the largest context window?

Qwen Long (Alibaba (Qwen)) leads on context at 10M tokens. This may matter if you also need long-document understanding alongside 200K+ 컨텍스트.

Which models are available on the most providers?

Production-readiness usually correlates with how many independent providers host the same weights. The top three by provider count are: Kimi K2.6 (49), Kimi K2.5 (48), GLM-5.1 (47).

How is 200K+ 컨텍스트 different from a regular LLM?

Long-context models accept ≥ 200K input tokens — enough for entire books, codebases or hours of transcripts in one prompt. Effective recall and per-token pricing both degrade with input length, so 'big context' is not always the right choice over RAG.

How often is this list updated?

Daily. Our data pipeline syncs once a day, regenerates the canonical model list, and rebuilds these pages so newly released models appear within 24 hours.

Top models with this capability

Ling-2.6-flash$0.01 in / $0.03 out
Google Gemma 3 27B Instruct$0.03 in / $0.11 out
Qwen3 235B A22B 2507$0.07 in / $0.10 out
Qwen3.5 9B$0.04 in / $0.15 out
Qwen3 235B A22B Instruct 2507$0.10 in / $0.10 out

Other capabilities

Best-of lists you might also want

Pricing comparisons

Vendors in this list

마지막 업데이트: 2026-06-29

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Pricing and capabilities are refreshed daily and reconciled against each provider's official documentation. Always verify critical production decisions with the provider directly.