Provedor · 2026-05-12
inference
| Modelo | Entrada / 1M | Saída / 1M | Contexto | Provedores | Etiquetas |
|---|---|---|---|---|---|
| Osmosis Structure 0.6B | $0.100 | $0.500 | 4K | 1 | tools · open-weights |
| Mistral Nemo 12B Instructderivado | $0.038 | $0.100 | 16K | 1 | tools · open-weights |
| Qwen 2.5 7B Vision Instructderivado | $0.200 | $0.200 | 125K | 1 | tools · vision · open-weights |
| Google Gemma 3derivado | $0.150 | $0.300 | 125K | 1 | tools · vision · open-weights |
Frequently asked questions
How many AI models does inference offer?
We track 1 canonical inference models plus 3 community fine-tunes / derivatives (excluded from the main table). The list is recomputed daily from models.dev.
Which inference model is the cheapest?
Osmosis Structure 0.6B is currently the lowest-priced inference model, at $0.100 per 1M input tokens and $0.500 per 1M output tokens. For the full apples-to-apples list, see /pricing/cheapest-llm-api.
Which inference model has the largest context window?
Osmosis Structure 0.6B leads at 4K tokens. This is the total of prompt + completion.
Which inference models support tool calling?
Multiple inference models support tool calling, with Osmosis Structure 0.6B being a popular pick. The capability column in the table above marks every model with inference tool-calling support.
What are the best alternatives to inference?
Depends on the use case. For raw cost savings, look at /pricing/cheapest-llm-api. For agent-oriented workloads, /best/best-ai-model-for-agents. For long-document workflows, /best/best-long-context-llm.
How fresh is this inference pricing data?
Daily. Our pipeline pulls models.dev each morning and rebuilds these pages on data change, so list-price moves and new model releases land within roughly 24 hours.
Explore more
Top inference models
- Osmosis Structure 0.6B$0.10 in / $0.50 out
Browse by use case
Browse by capability
Última atualização:
Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.
Data is sourced from models.dev and normalized for comparison. Prices and capabilities may change. Always verify critical production decisions with the provider's official documentation.