能力 · 2026-06-29

开放权重的 AI 模型

对比在许可下公开训练权重的 AI 模型 —— 适合自托管、私域微调和受监管环境。

这是什么？

开放权重模型在许可（常见 Apache 2.0、MIT 或自定义开放研究许可）下发布可下载的权重。
注意：开放权重 ≠ 开源 —— 训练数据与代码通常并不公开。

为什么重要

可在自有 GPU 上自托管、用私域数据微调、离线运行，或部署到受监管环境。
本页价格反映最便宜的 API 托管价；若自建硬件，同一模型也可零 API 成本运行。

487 个模型支持此能力

模型	厂商	输入 / 1M	输出 / 1M	上下文	服务商
Whisper Large v3	scaleway	$0.003	Unknown	Unknown	1
Voxtral Small 24B 2507	Mistral	$0.002	$0.002	32K	4
Whisper Large v3	OpenAI	$0.002	$0.002	448	3
KB Whisper	evroc	$0.002	$0.002	448	1
All-MiniLM-L6-v2	digitalocean	$0.009	Unknown	256	1
Multi-QA-mpnet-base-dot-v1	digitalocean	$0.009	Unknown	512	1
BGE Reranker v2 M3	digitalocean	$0.010	Unknown	8K	1
Llama 3.2 1B Instruct	Meta	$0.010	$0.010	60K	6
llama-3.1-nemotron-safety-guard-8b-v3	NVIDIA	$0.010	$0.010	128K	2
Prompt Guard 2 86M	Meta	$0.010	$0.010	512	2
Llama Prompt Guard 2 22M	Meta	$0.010	$0.010	512	2
E5 Large v2	digitalocean	$0.020	Unknown	512	1
BGE M3	digitalocean	$0.020	Unknown	8K	1
Llama 3.2 3B Instruct	Meta	$0.020	$0.020	80K	9
PaddleOCR-VL	novita-ai	$0.020	$0.020	16K	1
Meta-Llama-3.1-8B-Instruct	Meta	$0.020	$0.030	128K	20
Nomic Embed Text v1.5	tinfoil	$0.050	Unknown	8K	1
Mistral Nemo	Mistral	$0.020	$0.040	128K	6
Gemma 3n 4B	Google	$0.020	$0.040	33K	4
Meta-Llama-3-8B-Instruct	Meta	$0.030	$0.040	8K	9
Llama Guard 3 8B	Meta	$0.020	$0.060	131K	3
Ministral 3B (latest)	Mistral	$0.040	$0.040	128K	1
Stable Diffusion 3.5 Large	digitalocean	$0.080	Unknown	256	1
Ministral 3B	azure	$0.040	$0.040	128K	1
Ministral 3B	azure-cognitive-services	$0.040	$0.040	128K	1
Llama 3 8B Lunaris	Meta	$0.040	$0.050	8K	2
GTE Large (v1.5)	digitalocean	$0.090	Unknown	8K	1
Llama-3.2-11B-Vision-Instruct	Meta	$0.049	$0.049	128K	9
L3 8B Stheno V3.2	novita-ai	$0.050	$0.050	8K	1
Sao10k L3 8B Lunaris	novita-ai	$0.050	$0.050	8K	1
MythoMax 13B	kilo	$0.060	$0.060	4K	1
MythoMax 13B	openrouter	$0.060	$0.060	4K	1
Sarvam 30B	fastrouter	$0.020	$0.100	128K	1
IBM: Granite 4.0 Micro	kilo	$0.017	$0.110	131K	1
Granite 4.0 H Micro	cloudflare-workers-ai	$0.017	$0.112	131K	1
Granite 4.0 Micro	openrouter	$0.017	$0.112	131K	1
Llama 3.1 8B	Meta	$0.050	$0.080	131K	2
Google Gemma 3 27B Instruct	Google	$0.030	$0.110	203K	10
baichuan-m2-32b	novita-ai	$0.070	$0.070	131K	1
LiquidAI: LFM2-24B-A2B	kilo	$0.030	$0.120	33K	1
LFM2-24B-A2B	togetherai	$0.030	$0.120	33K	1
LFM2-24B-A2B	openrouter	$0.030	$0.120	33K	1
Granite 4.1 8B	openrouter	$0.050	$0.100	131K	1
R1 Distill Llama 70B	DeepSeek	$0.030	$0.140	8K	4
Qwen3 235B A22B 2507	Alibaba (Qwen)	$0.071	$0.100	262K	3
Mythomax L2 13B	novita-ai	$0.090	$0.090	4K	1
Command R7B	Cohere	$0.037	$0.150	128K	4
Command R7B Arabic	Cohere	$0.037	$0.150	128K	1
Qwen3.5 9B	Alibaba (Qwen)	$0.040	$0.150	262K	14
Arcee AI: Trinity Mini	kilo	$0.045	$0.150	131K	1
Trinity Mini	openrouter	$0.045	$0.150	131K	1
Trinity Mini	clarifai	$0.045	$0.150	131K	1
Qwen3 235B A22B Instruct 2507	Alibaba (Qwen)	$0.100	$0.100	262K	16
Qwen3-235B-A22B-Thinking-2507	Alibaba (Qwen)	$0.100	$0.100	262K	16
nvidia-nemotron-nano-9b-v2	NVIDIA	$0.040	$0.160	131K	5
Phi-4	Microsoft	$0.060	$0.140	128K	5
Ministral 3 3B 2512	Mistral	$0.100	$0.100	131K	3
Ministral 8B (latest)	Mistral	$0.100	$0.100	128K	1
Ministral 3B	llmgateway	$0.100	$0.100	131K	1
Reka Edge	kilo	$0.100	$0.100	16K	1

显示前 60 / 共 487 项。用完整目录进一步筛选。

Frequently asked questions

How many AI models support 开放权重?

487 canonical models in our database currently support 开放权重. The list is regenerated on every data refresh, so it always reflects the latest releases tracked in our catalogue.

What is the cheapest model with 开放权重?

Whisper Large v3 from scaleway is currently the lowest-priced option, at $0.003 per 1M input tokens and Unknown per 1M output tokens. The full table above is sorted price-ascending.

Which model with 开放权重 has the largest context window?

Llama 4 Scout 17B Instruct (US) (Meta) leads on context at 3.50M tokens. This may matter if you also need long-document understanding alongside 开放权重.

Which models are available on the most providers?

Production-readiness usually correlates with how many independent providers host the same weights. The top three by provider count are: Kimi K2.6 (49), Kimi K2.5 (48), GLM-5.1 (47).

How is 开放权重 different from a regular LLM?

Open-weight models publish their trained weights publicly. You can self-host on your own GPUs, fine-tune on private data or run offline. Note: open weights ≠ open source — training data and code are usually not released.

How often is this list updated?

Daily. Our data pipeline syncs once a day, regenerates the canonical model list, and rebuilds these pages so newly released models appear within 24 hours.

Top models with this capability

Whisper Large v3$0.00 in / $0.00 out
Voxtral Small 24B 2507$0.00 in / $0.00 out
Whisper Large v3$0.00 in / $0.00 out
KB Whisper$0.00 in / $0.00 out
All-MiniLM-L6-v2$0.01 in / $0.00 out

Other capabilities

Best-of lists you might also want

Pricing comparisons

Vendors in this list

最近更新： 2026-06-29

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Pricing and capabilities are refreshed daily and reconciled against each provider's official documentation. Always verify critical production decisions with the provider directly.