Provider · 2026-05-12
venice
| Model | Input / 1M | Output / 1M | Context | Providers | Tags |
|---|---|---|---|---|---|
| Venice Uncensored 1.2 | $0.200 | $0.900 | 128K | 1 | tools · json · vision · open-weights |
| Mercury 2 | $0.313 | $0.938 | 128K | 1 | tools · json · reasoning |
| Trinity Large Thinking | $0.313 | $1.13 | 256K | 1 | tools · json · reasoning · open-weights |
| Venice Role Play Uncensored | $0.500 | $2.00 | 128K | 1 | tools · json · vision · open-weights |
| Aion 2.0 | $1.00 | $2.00 | 128K | 1 | reasoning |
| Gemma 4 Uncensoredderivative | $0.163 | $0.500 | 256K | 1 | tools · json · vision · open-weights |
| Grok 4.1 Fastderivative | $0.230 | $0.570 | 1M | 1 | tools · json · reasoning · vision |
| GPT-4o Miniderivative | $0.188 | $0.750 | 128K | 1 | tools · json · vision |
| Nemotron Cascade 2 30B A3Bderivative | $0.140 | $0.800 | 256K | 1 | tools · json · reasoning · open-weights |
| GLM 4.7 Flash Hereticderivative | $0.140 | $0.800 | 200K | 1 | tools · json · reasoning · open-weights |
| MiniMax M2.5derivative | $0.340 | $1.19 | 198K | 1 | tools · reasoning |
| Qwen 3 Coder 480B Turboderivative | $0.350 | $1.50 | 256K | 1 | tools · json · open-weights |
| MiniMax M2.7derivative | $0.375 | $1.50 | 198K | 1 | tools · reasoning |
| Qwen 3 Next 80bderivative | $0.350 | $1.90 | 256K | 1 | tools · json · open-weights |
| Grok 4.20derivative | $1.42 | $2.83 | 2M | 1 | tools · json · reasoning · vision |
| GPT-5.4 Miniderivative | $0.938 | $5.63 | 400K | 1 | tools · json · reasoning · vision |
| GPT-5.3 Codexderivative | $2.19 | $17.50 | 400K | 1 | tools · json · reasoning · vision |
| GPT-5.2derivative | $2.19 | $17.50 | 256K | 1 | tools · json · reasoning |
| GPT-5.2 Codexderivative | $2.19 | $17.50 | 256K | 1 | tools · json · reasoning · vision |
| GPT-5.4derivative | $3.13 | $18.80 | 1M | 1 | tools · json · reasoning · vision |
| GPT-5.5derivative | $6.25 | $37.50 | 1M | 1 | tools · json · reasoning · vision |
| Claude Opus 4.6 Fastderivative | $36.00 | $180.00 | 1M | 1 | tools · json · reasoning · vision |
| GPT-5.5 Proderivative | $37.50 | $225.00 | 1M | 1 | tools · json · reasoning · vision |
| GPT-5.4 Proderivative | $37.50 | $225.00 | 1M | 1 | tools · json · reasoning · vision |
Frequently asked questions
How many AI models does venice offer?
We track 5 canonical venice models plus 19 community fine-tunes / derivatives (excluded from the main table). The list is recomputed daily from models.dev.
Which venice model is the cheapest?
Venice Uncensored 1.2 is currently the lowest-priced venice model, at $0.200 per 1M input tokens and $0.900 per 1M output tokens. For the full apples-to-apples list, see /pricing/cheapest-llm-api.
Which venice model has the largest context window?
Trinity Large Thinking leads at 256K tokens. This is the total of prompt + completion.
Which venice models support tool calling?
Multiple venice models support tool calling, with Venice Uncensored 1.2 being a popular pick. The capability column in the table above marks every model with venice tool-calling support.
Which venice models accept image input?
Venice Uncensored 1.2 accepts image input. Other vision-capable venice models are tagged 'vision' in the table above. See /capabilities/vision for a cross-vendor comparison.
What are the best alternatives to venice?
Depends on the use case. For raw cost savings, look at /pricing/cheapest-llm-api. For agent-oriented workloads, /best/best-ai-model-for-agents. For long-document workflows, /best/best-long-context-llm.
How fresh is this venice pricing data?
Daily. Our pipeline pulls models.dev each morning and rebuilds these pages on data change, so list-price moves and new model releases land within roughly 24 hours.
Explore more
Top venice models
- Venice Uncensored 1.2$0.20 in / $0.90 out
- Mercury 2$0.31 in / $0.94 out
- Trinity Large Thinking$0.31 in / $1.13 out
- Venice Role Play Uncensored$0.50 in / $2.00 out
- Aion 2.0$1.00 in / $2.00 out
Browse by use case
Browse by capability
Last updated:
Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.
Data is sourced from models.dev and normalized for comparison. Prices and capabilities may change. Always verify critical production decisions with the provider's official documentation.