能力 · 2026-05-12

AI Models with Structured Output Support

AI models that support JSON mode / structured output for reliable data extraction.

這是什麼？

Structured output (sometimes called JSON mode or response_format=json_schema) constrains the model to emit a JSON document that matches a schema you provide.
Unlike a plain prompt that says "reply in JSON", structured output is enforced at decode time — the model literally cannot emit invalid JSON.

為什麼重要

It eliminates JSON parsing errors and jailbreak prefixes like "Sure! Here's the JSON: ...".
It is essential for any pipeline that pipes LLM output into a typed system: data extraction, classification, structured summarisation.

206 個模型支援此能力

模型	廠商	輸入 / 1M	輸出 / 1M	上下文	服務商
Voxtral Small 24B 2507	Mistral	$0.002	$0.002	32K	3
dots.ocr	chutes	$0.010	$0.011	131K	1
Hermes 4 14B	chutes	$0.014	$0.054	41K	1
Sao10k L3 8B Lunaris	novita-ai	$0.050	$0.050	8K	1
Gemma 3 12B	Google	$0.030	$0.100	33K	10
Gemma 3 27B	Google	$0.027	$0.109	131K	14
DeepSeek R1 Distill Llama 70B	DeepSeek	$0.027	$0.109	8K	5
GPT OSS 20B	OpenAI	$0.030	$0.140	131K	23
GPT OSS 120B	OpenAI	$0.040	$0.160	131K	33
Qwen/Qwen3-VL-30B-A3B-Thinking	Alibaba (Qwen)	$0.100	$0.100	262K	6
Qwen/Qwen3-VL-30B-A3B-Instruct	Alibaba (Qwen)	$0.100	$0.100	262K	6
Qwen/Qwen3-VL-8B-Instruct	Alibaba (Qwen)	$0.100	$0.100	262K	5
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B	DeepSeek	$0.100	$0.100	131K	4
Ministral 3B	llmgateway	$0.100	$0.100	131K	1
Mistral Small 3.2 24B Instruct	Mistral	$0.060	$0.180	96K	3
GPT OSS 20B	databricks	$0.050	$0.200	131K	1
GPT OSS Safeguard 20B	OpenAI	$0.070	$0.200	128K	6
Qwen/Qwen2.5-VL-32B-Instruct	Alibaba (Qwen)	$0.050	$0.220	131K	6
GPT OSS 20B	frogbot	$0.070	$0.200	131K	1
Hermes 2 Pro Llama 3 8B	Meta	$0.140	$0.140	8K	4
Qwen 2.5 72B Instruct	Alibaba (Qwen)	$0.062	$0.231	32K	3
Ministral 8B	llmgateway	$0.150	$0.150	262K	1
Qwen3 Coder 30B A3B Instruct	Alibaba (Qwen)	$0.070	$0.270	262K	3
inclusionAI/Ling-mini-2.0	siliconflow-cn	$0.070	$0.280	131K	1
inclusionAI/Ling-mini-2.0	siliconflow	$0.070	$0.280	131K	1
GPT OSS 120B	databricks	$0.072	$0.280	131K	1
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B	DeepSeek	$0.180	$0.180	131K	6
Seed 1.6 Flash (250715)	llmgateway	$0.070	$0.300	256K	1
Gemini 2.0 Flash Lite	Google	$0.075	$0.300	1.05M	8
MiMo V2 Flash TEE	chutes	$0.090	$0.290	262K	1
Gemma 4 26B	Google	$0.100	$0.300	256K	8
Ling-2.6-flash	novita-ai	$0.100	$0.300	262K	1
XiaomiMiMo/MiMo-V2-Flash	novita-ai	$0.100	$0.300	262K	1
Ministral 14B	llmgateway	$0.200	$0.200	262K	1
Llama 3.2 11B Instruct	Meta	$0.070	$0.330	128K	1
DeepSeek V4 Flash	DeepSeek	$0.140	$0.280	1M	15
Phi-4-mini-instruct	Microsoft	$0.080	$0.350	128K	4
GPT-5 Nano	OpenAI	$0.050	$0.400	400K	17
Gemini 2.5 Flash Lite	Google	$0.100	$0.400	1.05M	13
GPT-4.1 nano	OpenAI	$0.100	$0.400	1.05M	12
Gemini 2.5 Flash Lite Preview 09-25	Google	$0.100	$0.400	1.05M	9
Gemini 2.0 Flash	Google	$0.100	$0.400	1.05M	6
Gemini 2.0 Flash	Google	$0.100	$0.400	1.05M	3
Qwen/Qwen3-Omni-30B-A3B-Thinking	Alibaba (Qwen)	$0.100	$0.400	66K	3
Qwen/Qwen3-Omni-30B-A3B-Instruct	Alibaba (Qwen)	$0.100	$0.400	66K	3
Qwen3.5 Flash	Alibaba (Qwen)	$0.100	$0.400	1M	3
Gemini Flash-Lite Latest	Google	$0.100	$0.400	1.05M	2
Gemma 4 31B	Google	$0.130	$0.380	256K	11
Qwen/Qwen3-VL-32B-Instruct	Alibaba (Qwen)	$0.104	$0.416	262K	3
Hermes 4 70B	openrouter	$0.130	$0.400	131K	1
Hermes-4-70B	nebius	$0.130	$0.400	128K	1
GPT OSS 20B	llmgateway	$0.100	$0.500	131K	1
inclusionAI/Ling-flash-2.0	siliconflow-cn	$0.140	$0.570	131K	1
inclusionAI/Ring-flash-2.0	siliconflow-cn	$0.140	$0.570	131K	1
tencent/Hunyuan-A13B-Instruct	siliconflow-cn	$0.140	$0.570	131K	1
inclusionAI/Ring-flash-2.0	siliconflow	$0.140	$0.570	131K	1
inclusionAI/Ling-flash-2.0	siliconflow	$0.140	$0.570	131K	1
tencent/Hunyuan-A13B-Instruct	siliconflow	$0.140	$0.570	131K	1
GPT-4o mini	OpenAI	$0.150	$0.600	128K	15
Gemini 2.5 Flash Preview 05-20	Google	$0.150	$0.600	1.05M	4

顯示前 60 項，共 206 項。用完整目錄進一步篩選。

Frequently asked questions

How many AI models support 結構化輸出?

206 canonical models in our database currently support 結構化輸出. The list is regenerated on every data refresh, so it always reflects the latest model releases from models.dev.

What is the cheapest model with 結構化輸出?

Voxtral Small 24B 2507 from Mistral is currently the lowest-priced option, at $0.002 per 1M input tokens and $0.002 per 1M output tokens. The full table above is sorted price-ascending.

Which model with 結構化輸出 has the largest context window?

GPT-5.4 (OpenAI) leads on context at 1.05M tokens. This may matter if you also need long-document understanding alongside 結構化輸出.

Which models are available on the most providers?

Production-readiness usually correlates with how many independent providers host the same weights. The top three by provider count are: Kimi K2.5 (45), GPT OSS 120B (33), GLM-5.1 (33).

How is 結構化輸出 different from a regular LLM?

Structured output (a.k.a. JSON mode / response_format=json_schema) constrains the model at decode time so it cannot emit invalid JSON. This is stricter than just prompting 'reply in JSON' and removes a whole class of parsing errors.

How often is this list updated?

Daily. Our data pipeline pulls models.dev once a day, regenerates the canonical model list, and rebuilds these pages so newly released models appear within 24 hours.

Top models with this capability

Voxtral Small 24B 2507$0.00 in / $0.00 out
dots.ocr$0.01 in / $0.01 out
Hermes 4 14B$0.01 in / $0.05 out
Sao10k L3 8B Lunaris $0.05 in / $0.05 out
Gemma 3 12B$0.03 in / $0.10 out

Other capabilities

Best-of lists you might also want

Pricing comparisons

Vendors in this list

最近更新： 2026-05-12

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Data is sourced from models.dev and normalized for comparison. Prices and capabilities may change. Always verify critical production decisions with the provider's official documentation.