AI Model Intelligence

Provider · 2026-05-12

Cerebras

1 canonical models2 total entries (including derivatives)
ModelInput / 1MOutput / 1MContextProvidersTags
GPT OSS 120B$0.250$0.690131K1tools · reasoning · open-weights
Qwen 3 235B Instructderivative$0.600$1.20131K1tools · open-weights

Frequently asked questions

How many AI models does Cerebras offer?

We track 1 canonical Cerebras models plus 1 community fine-tunes / derivatives (excluded from the main table). The list is recomputed daily from models.dev.

Which Cerebras model is the cheapest?

GPT OSS 120B is currently the lowest-priced Cerebras model, at $0.250 per 1M input tokens and $0.690 per 1M output tokens. For the full apples-to-apples list, see /pricing/cheapest-llm-api.

Which Cerebras model has the largest context window?

GPT OSS 120B leads at 131K tokens. This is the total of prompt + completion.

Which Cerebras models support tool calling?

Multiple Cerebras models support tool calling, with GPT OSS 120B being a popular pick. The capability column in the table above marks every model with Cerebras tool-calling support.

What are the best alternatives to Cerebras?

Depends on the use case. For raw cost savings, look at /pricing/cheapest-llm-api. For agent-oriented workloads, /best/best-ai-model-for-agents. For long-document workflows, /best/best-long-context-llm.

How fresh is this Cerebras pricing data?

Daily. Our pipeline pulls models.dev each morning and rebuilds these pages on data change, so list-price moves and new model releases land within roughly 24 hours.

Last updated:

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Data is sourced from models.dev and normalized for comparison. Prices and capabilities may change. Always verify critical production decisions with the provider's official documentation.