AI 模型情报

能力 · 2026-05-12

开放权重的 AI 模型

对比在许可下公开训练权重的 AI 模型 —— 适合自托管、私域微调和受监管环境。

这是什么?

  • 开放权重模型在许可(常见 Apache 2.0、MIT 或自定义开放研究许可)下发布可下载的权重。
  • 注意:开放权重 ≠ 开源 —— 训练数据与代码通常并不公开。

为什么重要

  • 可在自有 GPU 上自托管、用私域数据微调、离线运行,或部署到受监管环境。
  • 本页价格反映最便宜的 API 托管价;若自建硬件,同一模型也可零 API 成本运行。

414 个模型支持此能力

模型厂商输入 / 1M输出 / 1M上下文服务商
Whisper Large v3scaleway$0.003UnknownUnknown1
Voxtral Small 24B 2507Mistral$0.002$0.00232K3
KB Whisperevroc$0.002$0.0024481
Multi-QA-mpnet-base-dot-v1digitalocean$0.009Unknown5121
All-MiniLM-L6-v2digitalocean$0.009Unknown2561
BGE Reranker v2 M3digitalocean$0.010Unknown8K1
Llama 3.2 1B InstructMeta$0.010$0.01016K5
Llama Prompt Guard 2 22MMeta$0.010$0.0105122
Llama Prompt Guard 2 86MMeta$0.010$0.0105122
E5 Large v2digitalocean$0.020Unknown5121
BGE M3digitalocean$0.020Unknown8K1
dots.ocrchutes$0.010$0.011131K1
Gemma 3 4BGoogle$0.010$0.02733K7
PaddleOCR-VLnovita-ai$0.020$0.02016K1
Meta-Llama-3.1-8B-InstructMeta$0.020$0.030128K25
Llama 3.1 8B TurboMeta$0.020$0.030131K2
Mistral Nemo Instruct 2407Mistral$0.020$0.040128K8
Gemma 3n 4BGoogle$0.020$0.0408K5
Hermes 4 14Bchutes$0.014$0.05441K1
Meta-Llama-3-8B-InstructMeta$0.030$0.0408K8
Llama Guard 3 8BMeta$0.020$0.0608K3
Ministral 3Bazure-cognitive-services$0.040$0.040128K1
Stable Diffusion 3.5 Largedigitalocean$0.080Unknown2561
Ministral 3B (latest)Mistral$0.040$0.040128K1
Ministral 3Bazure$0.040$0.040128K1
Sao10K: Llama 3 8B LunarisMeta$0.040$0.0508K1
GTE Large (v1.5)digitalocean$0.090Unknown8K1
Llama-3.2-11B-Vision-InstructMeta$0.049$0.049128K8
Sao10k L3 8B Lunaris novita-ai$0.050$0.0508K1
L3 8B Stheno V3.2novita-ai$0.050$0.0508K1
MythoMax 13Bkilo$0.060$0.0604K1
IBM: Granite 4.0 Microkilo$0.017$0.110131K1
Gemma 3 12BGoogle$0.030$0.10033K10
Llama 3.1 8B InstantMeta$0.050$0.080131K2
Llama 3 8BMeta$0.050$0.0808K1
Gemma 3 27BGoogle$0.027$0.109131K14
Qwen2.5-Coder 32B InstructAlibaba (Qwen)$0.027$0.109131K7
DeepSeek R1 Distill Llama 70BDeepSeek$0.027$0.1098K5
baichuan-m2-32bnovita-ai$0.070$0.070131K1
LiquidAI: LFM2-24B-A2Bkilo$0.030$0.12033K1
DeepSeek R1 Distill Llama 70BMeta$0.030$0.130131K5
GPT OSS 20BOpenAI$0.030$0.140131K23
Mythomax L2 13Bnovita-ai$0.090$0.0904K1
Command R7BCohere$0.037$0.150128K2
Command R7B ArabicCohere$0.037$0.150128K1
Arcee AI: Trinity Minikilo$0.045$0.150131K1
Trinity Miniclarifai$0.045$0.150131K1
GPT OSS 120BOpenAI$0.040$0.160131K33
Qwen3 235B A22B Instruct 2507Alibaba (Qwen)$0.100$0.100262K18
Qwen3-235B-A22B-Thinking-2507Alibaba (Qwen)$0.100$0.100262K17
Qwen3 30B A3B Instruct 2507Alibaba (Qwen)$0.100$0.100262K12
Qwen3 30B A3B Thinking 2507Alibaba (Qwen)$0.100$0.100262K7
nvidia-nemotron-nano-9b-v2NVIDIA$0.040$0.160131K6
Qwen/Qwen3.5-9BAlibaba (Qwen)$0.050$0.150262K6
Phi-4Microsoft$0.060$0.140128K4
Reka Edgekilo$0.100$0.10016K1
Llama 3.1 8BMeta$0.100$0.10032K1
GPT OSS 120Bsynthetic$0.100$0.100128K1
Ministral 8B (latest)Mistral$0.100$0.100128K1
Ministral 3Bllmgateway$0.100$0.100131K1

显示前 60 / 共 414 项。 完整目录 进一步筛选。

Frequently asked questions

How many AI models support 开放权重?

414 canonical models in our database currently support 开放权重. The list is regenerated on every data refresh, so it always reflects the latest model releases from models.dev.

What is the cheapest model with 开放权重?

Whisper Large v3 from scaleway is currently the lowest-priced option, at $0.003 per 1M input tokens and Unknown per 1M output tokens. The full table above is sorted price-ascending.

Which model with 开放权重 has the largest context window?

Llama 4 Scout 17B Instruct (Meta) leads on context at 3.50M tokens. This may matter if you also need long-document understanding alongside 开放权重.

Which models are available on the most providers?

Production-readiness usually correlates with how many independent providers host the same weights. The top three by provider count are: Kimi K2.5 (45), MiniMax-M2.5 (40), GLM-5 (38).

How is 开放权重 different from a regular LLM?

Open-weight models publish their trained weights publicly. You can self-host on your own GPUs, fine-tune on private data or run offline. Note: open weights ≠ open source — training data and code are usually not released.

How often is this list updated?

Daily. Our data pipeline pulls models.dev once a day, regenerates the canonical model list, and rebuilds these pages so newly released models appear within 24 hours.

最近更新:

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Data is sourced from models.dev and normalized for comparison. Prices and capabilities may change. Always verify critical production decisions with the provider's official documentation.