Intelligence des modèles d'IA

Fournisseur · 2026-06-29

llmgateway

20 modèles canoniques36 entrées au total (dérivés inclus)
ModèleEntrée / 1MSortie / 1MContexteFournisseursTags
GPT OSS 20B$0.040$0.150131K1tools · json · reasoning
Ministral 3B$0.100$0.100131K1json · vision · open-weights
Ministral 8B$0.150$0.150262K1json · vision · open-weights
GPT OSS 120B$0.050$0.250131K1tools · json · reasoning
Seed 1.6 Flash (250715)$0.070$0.300256K1tools · json · reasoning · vision · open-weights
Ministral 14B$0.200$0.200262K1json · vision · open-weights
MiMo-V2.5$0.140$0.2801M1tools · reasoning · vision · open-weights
MiMo-V2.5-Pro$0.435$0.8701M1tools · reasoning · open-weights
Seed 1.8 (251228)$0.250$2.00256K1tools · json · reasoning · vision · open-weights
Seed 1.6 (250615)$0.250$2.00256K1tools · json · reasoning · vision · open-weights
Seed 1.6 (250915)$0.250$2.00256K1tools · json · reasoning · vision · open-weights
MiMo-V2-Omni$0.400$2.00256K1tools · vision
QwQ Plus$0.800$2.40131K1tools · reasoning
MiMo-V2-Pro$1.00$3.001M1tools · reasoning
o4-mini$1.10$4.40200K1tools · json · reasoning · vision
Pixtral Large (latest)$4.00$12.00128K1tools · vision · open-weights
Fugu Ultra$5.00$30.001M1tools · reasoning · vision
o1$15.00$60.00200K1tools · json · reasoning · vision
Auto RouteInconnuInconnu128K1tools · json · vision
Custom ModelInconnuInconnu128K1tools · json · vision
Qwen3 4B FP8dérivé$0.030$0.030128K1tools · json · reasoning · open-weights
GLM-4 32B (0414-128k)dérivé$0.100$0.100128K1tools · json
GLM-4.6V FlashXdérivé$0.040$0.400128K1tools · json · reasoning · vision
Qwen3 VL Flashdérivé$0.050$0.400262K1tools · json · vision
MiniMax M2.1 Lightningdérivé$0.120$0.480197K1reasoning · open-weights
MiniMax Text 01dérivé$0.200$1.101M1reasoning · open-weights
Qwen Coder Plusdérivé$0.502$1.00131K1tools · json
Qwen Plus Latestdérivé$0.400$1.201M1tools · json · vision
Nemotron 3 Ultra 550B A55Bdérivé$0.500$2.50262K1tools · reasoning · open-weights
Qwen3.5 397B-A17Bdérivé$0.600$3.60262K1tools · json · reasoning · vision · open-weights
GLM-4.5 AirXdérivé$1.10$4.50128K1tools · json
Grok 4.20 (Non-Reasoning)dérivé$2.00$6.002M1tools · json · vision
Grok 4.20 (Reasoning)dérivé$2.00$6.002M1tools · json · reasoning · vision
Qwen Max Latestdérivé$1.60$6.4033K1tools · json · vision
GLM-4.5 Xdérivé$2.20$8.90128K1tools · json · reasoning
GLM-4.7 Flash (Free)dérivéInconnuInconnu200K1tools · reasoning

Frequently asked questions

How many AI models does llmgateway offer?

We track 20 canonical llmgateway models plus 16 community fine-tunes / derivatives (excluded from the main table). The list is recomputed daily.

Which llmgateway model is the cheapest?

GPT OSS 20B is currently the lowest-priced llmgateway model, at $0.040 per 1M input tokens and $0.150 per 1M output tokens. For the full apples-to-apples list, see /pricing/cheapest-llm-api.

Which llmgateway model has the largest context window?

MiMo-V2.5 leads at 1M tokens. This is the total of prompt + completion.

Which llmgateway models support tool calling?

Multiple llmgateway models support tool calling, with GPT OSS 20B being a popular pick. The capability column in the table above marks every model with llmgateway tool-calling support.

Which llmgateway models accept image input?

Ministral 3B accepts image input. Other vision-capable llmgateway models are tagged 'vision' in the table above. See /capabilities/vision for a cross-vendor comparison.

What are the best alternatives to llmgateway?

Depends on the use case. For raw cost savings, look at /pricing/cheapest-llm-api. For agent-oriented workloads, /best/best-ai-model-for-agents. For long-document workflows, /best/best-long-context-llm.

How fresh is this llmgateway pricing data?

Daily. Our pipeline syncs every morning and rebuilds these pages on data change, so list-price moves and new model releases land within roughly 24 hours.

Dernière mise à jour :

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Pricing and capabilities are refreshed daily and reconciled against each provider's official documentation. Always verify critical production decisions with the provider directly.