기능 · 2026-06-29

구조화 출력을 지원하는 AI 모델

JSON mode 또는 스키마 준수 출력을 지원하는 AI 모델 비교 — 추출 및 분류 파이프라인에 적합.

이게 뭔가요?

구조화 출력(JSON mode 또는 response_format=json_schema)은 모델이 제공한 스키마에 맞는 JSON 문서만 생성하도록 강제합니다.
프롬프트에서 'JSON으로 답해줘'라고 요청하는 것과 달리, 디코딩 시점에 적용되어 모델이 유효하지 않은 JSON을 출력할 수 없습니다.

왜 중요한가

파싱 오류와 '네, JSON은 다음과 같습니다…' 같은 접두사를 줄여줍니다.
LLM 출력을 타입 시스템에 연결하는 파이프라인에 필수: 추출, 분류, 구조화 요약 등.

이 기능을 지원하는 모델 306개

모델	벤더	입력 / 1M	출력 / 1M	컨텍스트	제공자
Voxtral Small 24B 2507	Mistral	$0.002	$0.002	32K	4
Ling-2.6-flash	openrouter	$0.010	$0.030	262K	1
Gemma 3n 4B	Google	$0.020	$0.040	33K	4
Llama 3 8B Lunaris	Meta	$0.040	$0.050	8K	2
Sao10k L3 8B Lunaris	novita-ai	$0.050	$0.050	8K	1
MythoMax 13B	openrouter	$0.060	$0.060	4K	1
Google Gemma 3 27B Instruct	Google	$0.030	$0.110	203K	10
Google Gemma 3 12B	Google	$0.050	$0.100	131K	7
Granite 4.1 8B	openrouter	$0.050	$0.100	131K	1
Granite 4.1 8B	nano-gpt	$0.050	$0.100	131K	1
gpt-oss-20b	OpenAI	$0.029	$0.140	128K	24
Qwen3 235B A22B 2507	Alibaba (Qwen)	$0.071	$0.100	262K	3
gpt-oss-120b	OpenAI	$0.030	$0.150	128K	37
Qwen3.5 9B	Alibaba (Qwen)	$0.040	$0.150	262K	14
GPT OSS 20B	llmgateway	$0.040	$0.150	131K	1
Trinity Mini	openrouter	$0.045	$0.150	131K	1
Ministral 3 3B 2512	Mistral	$0.100	$0.100	131K	3
Ministral 3B	llmgateway	$0.100	$0.100	131K	1
Reka Edge	openrouter	$0.100	$0.100	16K	1
Mistral Small 3.2 24B	Mistral	$0.060	$0.180	128K	3
GPT OSS 20B	databricks	$0.050	$0.200	131K	1
GPT OSS 20B	neon	$0.050	$0.200	131K	1
GPT OSS Safeguard 20B	OpenAI	$0.070	$0.200	128K	6
Qwen2.5 VL 32B Instruct	Alibaba (Qwen)	$0.050	$0.220	131K	3
GPT OSS 20B	frogbot	$0.070	$0.200	131K	1
Hermes 2 Pro Llama 3 8B	Meta	$0.140	$0.140	8K	3
Qwen2.5 72B Instruct	Alibaba (Qwen)	$0.062	$0.231	33K	5
Ministral 3 8B 2512	Mistral	$0.150	$0.150	262K	3
Ministral 8B	llmgateway	$0.150	$0.150	262K	1
GPT OSS 120B	llmgateway	$0.050	$0.250	131K	1
Reka Flash 3	openrouter	$0.100	$0.200	66K	1
Ling 2.6 Flash	nano-gpt	$0.080	$0.240	262K	1
GPT OSS 120B	databricks	$0.072	$0.280	131K	1
GPT OSS 120B	neon	$0.072	$0.280	131K	1
Qwen3 30B A3B	Alibaba (Qwen)	$0.080	$0.280	41K	10
Seed 1.6 Flash (250715)	llmgateway	$0.070	$0.300	256K	1
Gemini 2.0 Flash-Lite	Google	$0.075	$0.300	1.05M	4
Seed 1.6 Flash	openrouter	$0.075	$0.300	262K	1
Llama 4 Scout	Meta	$0.080	$0.300	328K	5
Gemma 4 26B A4B IT	Google	$0.060	$0.330	262K	16
Gemma 4 31B IT	Google	$0.100	$0.300	262K	26
Ministral 3 14B 2512	Mistral	$0.200	$0.200	262K	3
Ling-2.6-flash	novita-ai	$0.100	$0.300	262K	1
XiaomiMiMo/MiMo-V2-Flash	novita-ai	$0.100	$0.300	262K	1
Ministral 14B	llmgateway	$0.200	$0.200	262K	1
Llama 3.2 11B Instruct	Meta	$0.070	$0.330	128K	1
OWL	nano-gpt	$0.100	$0.300	1.05M	1
Mistral Small 3.2 24B Instruct 2506	Mistral	$0.100	$0.310	32K	5
DeepSeek V4 Flash	DeepSeek	$0.140	$0.280	1M	31
Coding Router Low	nano-gpt	$0.140	$0.280	1M	1
Coding Router Medium	nano-gpt	$0.140	$0.280	1M	1
Phi 4 Mini Instruct	Microsoft	$0.080	$0.350	128K	5
GPT-4o-mini Search Preview	OpenAI	$0.088	$0.350	128K	5
Gemini 2.5 Flash Lite Preview 09-2025	Google	$0.090	$0.360	1.05M	6
Qwen3.5 Flash	Alibaba (Qwen)	$0.090	$0.360	1M	4
GPT-5 Nano	OpenAI	$0.050	$0.400	400K	22
Gemini 2.5 Flash-Lite	Google	$0.100	$0.400	1.05M	17
GPT-4.1 nano	OpenAI	$0.100	$0.400	1.05M	16
Gemini Flash-Lite Latest	Google	$0.100	$0.400	1.05M	5
Gemini 2.0 Flash	Google	$0.100	$0.400	1.05M	3

전체 306개 중 상위 60개 표시. 추가 필터링은 전체 목록을 이용하세요.

Frequently asked questions

How many AI models support 구조화 출력?

306 canonical models in our database currently support 구조화 출력. The list is regenerated on every data refresh, so it always reflects the latest releases tracked in our catalogue.

What is the cheapest model with 구조화 출력?

Voxtral Small 24B 2507 from Mistral is currently the lowest-priced option, at $0.002 per 1M input tokens and $0.002 per 1M output tokens. The full table above is sorted price-ascending.

Which model with 구조화 출력 has the largest context window?

Grok 4.20 Multi-Agent (xAI) leads on context at 2M tokens. This may matter if you also need long-document understanding alongside 구조화 출력.

Which models are available on the most providers?

Production-readiness usually correlates with how many independent providers host the same weights. The top three by provider count are: Kimi K2.6 (49), Kimi K2.5 (48), GLM-5.1 (47).

How is 구조화 출력 different from a regular LLM?

Structured output (a.k.a. JSON mode / response_format=json_schema) constrains the model at decode time so it cannot emit invalid JSON. This is stricter than just prompting 'reply in JSON' and removes a whole class of parsing errors.

How often is this list updated?

Daily. Our data pipeline syncs once a day, regenerates the canonical model list, and rebuilds these pages so newly released models appear within 24 hours.

Top models with this capability

Voxtral Small 24B 2507$0.00 in / $0.00 out
Ling-2.6-flash$0.01 in / $0.03 out
Gemma 3n 4B$0.02 in / $0.04 out
Llama 3 8B Lunaris$0.04 in / $0.05 out
Sao10k L3 8B Lunaris $0.05 in / $0.05 out

Other capabilities

Best-of lists you might also want

Pricing comparisons

Vendors in this list

마지막 업데이트: 2026-06-29

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Pricing and capabilities are refreshed daily and reconciled against each provider's official documentation. Always verify critical production decisions with the provider directly.