gpt-oss-120b

openai/gpt-oss-120b

Von OpenAI · Familie: gpt-oss · veröffentlicht 2025-08-05 · Wissensstand: 2025-08

$0.030

Eingabe / 1 Mio. Tokens

$0.150

Ausgabe / 1 Mio. Tokens

128K

Kontextfenster

16K

Max. Ausgabe

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Fähigkeiten

✓ Tool Calling✓ Reasoning✓ Strukturierte Ausgabe✗ Anhänge✗ Offene Gewichte✓ Temperatur-Steuerung

Modalitäten: Eingabe text · Ausgabe text

Model fit scores

0–100 · higher is better

These scores reward declared capabilities, context size, price and provider availability — they are not benchmark results. Use them as a directional signal alongside your own evaluation.

Coding82

Tool calling40/40
Structured output20/20
Reasoning10/10
Context window (100K → 1M)2/20
Provider availability10/10

Agents95

Tool calling35/35
Structured output25/25
Reasoning15/15
Output token limit10/15
Provider availability10/10

JSON / structured output100

Structured output / JSON mode50/50
Tool calling20/20
Temperature control10/10
Price-friendly for high-volume20/20

Cost efficiency81

Headline price (log-scaled)81/95
Has prompt-cache pricing0/5

Long context45

Context window (100K → 2M)35/90
Has published price for full window10/10

Production-readiness94

Number of independent providers40/40
Has published per-token price20/20
Context window ≥ 8K15/15
No data inconsistencies across providers4/10
Official model (not derivative)15/15

Cost Efficiency Index

Open full calculator →

Estimated cost using the recommended provider's headline rate. Each scenario fixes average input/output tokens — the assumptions are shown in the third column.

Scenario	Cost	Assumption
RAG answer per 1,000 RAG answers	$0.23 < $0.01 per request	5K input tokens (query + 4 retrieved chunks of ~1K each) and a 500-token answer. Typical SaaS knowledge-base bot.
Support ticket triage per 10,000 tickets	$0.45 < $0.01 per request	1K input tokens (ticket body + system prompt) and a 100-token JSON classification reply. High-volume customer support.
Data extraction per 1,000 documents	$0.14 < $0.01 per request	2K input tokens (a single document page) and a 500-token JSON extraction. ETL / invoice / form pipelines.
Code review per 1,000 PRs	$0.39 < $0.01 per request	8K input tokens (diff + surrounding files) and a 1K-token review comment. PR-bot workloads.
Agent step per 1,000 steps	$0.45 < $0.01 per request	12K input tokens (long-running tool history) and a 600-token tool-call decision. Cost per agent step.

Preis-Details

Empfohlene Preise von openrouter · openai/gpt-oss-120b

$0.030

Eingabe

$0.150

Ausgabe

Günstigster Anbieter: privatemode-ai · Unknown Eingabe + Unknown Ausgabe

Bei 37 Anbietern verfügbar

Anbieter	Anbieter-Modell-ID	Eingabe / 1M	Ausgabe / 1M	Kontext	Veröffentlicht
Amazon Bedrock amazon-bedrock	openai.gpt-oss-120b	$0.150	$0.600	128K	2025-08-05
OpenRouter openrouter	openai/gpt-oss-120b	$0.030	$0.150	131K	2025-08-05
Vercel AI Gateway vercel	openai/gpt-oss-120b	$0.100	$0.500	131K	2025-08-05
Groq groq	openai/gpt-oss-120b	$0.150	$0.600	131K	2025-08-05
Together AI togetherai	openai/gpt-oss-120b	$0.150	$0.600	131K	2025-08-05
Deep Infra deepinfra	openai/gpt-oss-120b	$0.039	$0.190	131K	2025-08-05
Hugging Face huggingface	openai/gpt-oss-120b	$0.250	$0.690	131K	2025-08-05
Qiniu qiniu-ai	gpt-oss-120b	Unknown	Unknown	128K	2025-08-06
Regolo AI regolo-ai	gpt-oss-120b	$1.00	$4.20	128K	2025-08-05
STACKIT stackit	openai/gpt-oss-120b	$0.490	$0.710	131K	2025-08-05
submodel submodel	openai/gpt-oss-120b	$0.100	$0.500	131K	2025-08-23
NovitaAI novita-ai	openai/gpt-oss-120b	$0.050	$0.250	131K	2025-08-06
Privatemode AI privatemode-ai	gpt-oss-120b	Unknown	Unknown	128K	2025-08-04
Nebius Token Factory nebius	openai/gpt-oss-120b	$0.150	$0.600	128K	2026-01-10
Tinfoil tinfoil	gpt-oss-120b	$0.150	$0.600	131K	2025-08-05
Cloudflare Workers AI cloudflare-workers-ai	@cf/openai/gpt-oss-120b	$0.350	$0.750	128K	2025-08-05
DigitalOcean digitalocean	openai-gpt-oss-120b	$0.100	$0.700	131K	2025-08-05
Venice AI venice	openai-gpt-oss-120b	$0.070	$0.300	128K	2025-11-06
Berget.AI berget	openai/gpt-oss-120b	$0.220	$0.830	128K	2025-08-05
Helicone helicone	gpt-oss-120b	$0.040	$0.160	131K	2024-06-01
NEAR AI Cloud nearai	openai/gpt-oss-120b	$0.150	$0.550	131K	2025-08-05
Abacus abacus	openai/gpt-oss-120b	$0.080	$0.440	128K	2025-08-05
CloudFerro Sherlock cloudferro-sherlock	openai/gpt-oss-120b	$2.92	$2.92	131K	2025-08-28
Ollama Cloud ollama-cloud	gpt-oss:120b	Unknown	Unknown	131K	2025-08-05
Cloudflare AI Gateway cloudflare-ai-gateway	workers-ai/@cf/openai/gpt-oss-120b	$0.350	$0.750	128K	2025-08-05
Nvidia nvidia	openai/gpt-oss-120b	Unknown	Unknown	128K	2025-08-04
evroc evroc	openai/gpt-oss-120b	$0.230	$0.920	66K	2025-08-05
SiliconFlow siliconflow	openai/gpt-oss-120b	$0.050	$0.450	131K	2025-08-13
IO.NET io-net	openai/gpt-oss-120b	$0.040	$0.400	131K	2024-12-01
Scaleway scaleway	gpt-oss-120b	$0.150	$0.600	128K	2024-01-01
OVHcloud AI Endpoints ovhcloud	gpt-oss-120b	$0.090	$0.470	131K	2025-08-28
Weights & Biases wandb	openai/gpt-oss-120b	$0.150	$0.600	131K	2025-08-05
Kilo Gateway kilo	openai/gpt-oss-120b	$0.039	$0.190	131K	2025-08-05
FastRouter fastrouter	openai/gpt-oss-120b	$0.150	$0.600	131K	2025-08-05
Baseten baseten	openai/gpt-oss-120b	$0.100	$0.500	128K	2025-08-05
NanoGPT nano-gpt	openai/gpt-oss-120b	$0.050	$0.250	128K	2025-08-05
NanoGPT nano-gpt	TEE/gpt-oss-120b	$2.00	$2.00	131K	2025-08-05

Datenunterschiede zwischen Anbietern

context_window varies: 128000, 128072, 131000, 131072, 65536
release_date varies (span 740d): 2024-01-01, 2024-06-01, 2024-12-01, 2025-08-04, 2025-08-05, 2025-08-06, 2025-08-13, 2025-08-23, 2025-08-28, 2025-11-06, 2026-01-10
modalities varies across offerings

Anbieter melden unterschiedliche Werte für dieses Modell. Die Schnellinfos oben nutzen den repräsentativen Anbieter; pro Anbieter siehe Tabelle.

Frequently asked questions

How much does gpt-oss-120b cost?

gpt-oss-120b costs $0.030 per 1M input tokens and $0.150 per 1M output tokens, sourced from openrouter. Cache reads, audio tokens and >200K-context tiers (where applicable) are listed in the Pricing detail block above.

What is the context window of gpt-oss-120b?

gpt-oss-120b has a context window of 128K tokens, with a max output of 16K tokens per reply. This is the total combined size of prompt + completion.

Does gpt-oss-120b support tool calling?

Yes. gpt-oss-120b supports tool calling (function calling). This makes it suitable for production agent and automation workloads where the model has to invoke external functions reliably.

Does gpt-oss-120b support structured output / JSON mode?

Yes. gpt-oss-120b supports structured output / JSON-schema-constrained decoding. This makes it suitable for production agent and automation workloads where the model has to invoke external functions reliably.

Can gpt-oss-120b accept image input?

No. gpt-oss-120b only accepts text as input. If you need image input, see our /capabilities/vision list for current vision-capable models.

Is gpt-oss-120b open-weight?

No. gpt-oss-120b is a proprietary model — only OpenAI (and any approved hosting partners) can serve it. The pricing above reflects the cheapest API access.

What are the best alternatives to gpt-oss-120b?

If gpt-oss-120b doesn't fit, consider GPT-5.4, GPT-5.2, GPT-5 Mini. Each one targets the same use case — see the Related links below for direct head-to-head pages.

Where does this data come from?

All numbers are normalised into a single canonical model record and reconciled with each provider's official documentation. We re-pull daily and write any changes (price, context, capability) to the /changelog page.

More OpenAI models

GPT-5.4$2.50 in / $15.00 out
GPT-5.2$1.75 in / $14.00 out
GPT-5 Mini$0.25 in / $2.00 out
GPT-5.5$5.00 in / $30.00 out
GPT-5$1.25 in / $10.00 out

gpt-oss-120b

Fähigkeiten

Model fit scores

Cost Efficiency Index

Preis-Details

Bei 37 Anbietern verfügbar

Datenunterschiede zwischen Anbietern

Frequently asked questions

More OpenAI models

Capability lists this model is in

See also

Fähigkeiten

Model fit scores

Cost Efficiency Index

Preis-Details

Bei 37 Anbietern verfügbar

Datenunterschiede zwischen Anbietern

Frequently asked questions

Explore more

More OpenAI models

Capability lists this model is in

See also