AI Model Intelligence

Provider · 2026-05-12

venice

5 canonical models24 total entries (including derivatives)
ModelInput / 1MOutput / 1MContextProvidersTags
Venice Uncensored 1.2$0.200$0.900128K1tools · json · vision · open-weights
Mercury 2$0.313$0.938128K1tools · json · reasoning
Trinity Large Thinking$0.313$1.13256K1tools · json · reasoning · open-weights
Venice Role Play Uncensored$0.500$2.00128K1tools · json · vision · open-weights
Aion 2.0$1.00$2.00128K1reasoning
Gemma 4 Uncensoredderivative$0.163$0.500256K1tools · json · vision · open-weights
Grok 4.1 Fastderivative$0.230$0.5701M1tools · json · reasoning · vision
GPT-4o Miniderivative$0.188$0.750128K1tools · json · vision
Nemotron Cascade 2 30B A3Bderivative$0.140$0.800256K1tools · json · reasoning · open-weights
GLM 4.7 Flash Hereticderivative$0.140$0.800200K1tools · json · reasoning · open-weights
MiniMax M2.5derivative$0.340$1.19198K1tools · reasoning
Qwen 3 Coder 480B Turboderivative$0.350$1.50256K1tools · json · open-weights
MiniMax M2.7derivative$0.375$1.50198K1tools · reasoning
Qwen 3 Next 80bderivative$0.350$1.90256K1tools · json · open-weights
Grok 4.20derivative$1.42$2.832M1tools · json · reasoning · vision
GPT-5.4 Miniderivative$0.938$5.63400K1tools · json · reasoning · vision
GPT-5.3 Codexderivative$2.19$17.50400K1tools · json · reasoning · vision
GPT-5.2derivative$2.19$17.50256K1tools · json · reasoning
GPT-5.2 Codexderivative$2.19$17.50256K1tools · json · reasoning · vision
GPT-5.4derivative$3.13$18.801M1tools · json · reasoning · vision
GPT-5.5derivative$6.25$37.501M1tools · json · reasoning · vision
Claude Opus 4.6 Fastderivative$36.00$180.001M1tools · json · reasoning · vision
GPT-5.5 Proderivative$37.50$225.001M1tools · json · reasoning · vision
GPT-5.4 Proderivative$37.50$225.001M1tools · json · reasoning · vision

Frequently asked questions

How many AI models does venice offer?

We track 5 canonical venice models plus 19 community fine-tunes / derivatives (excluded from the main table). The list is recomputed daily from models.dev.

Which venice model is the cheapest?

Venice Uncensored 1.2 is currently the lowest-priced venice model, at $0.200 per 1M input tokens and $0.900 per 1M output tokens. For the full apples-to-apples list, see /pricing/cheapest-llm-api.

Which venice model has the largest context window?

Trinity Large Thinking leads at 256K tokens. This is the total of prompt + completion.

Which venice models support tool calling?

Multiple venice models support tool calling, with Venice Uncensored 1.2 being a popular pick. The capability column in the table above marks every model with venice tool-calling support.

Which venice models accept image input?

Venice Uncensored 1.2 accepts image input. Other vision-capable venice models are tagged 'vision' in the table above. See /capabilities/vision for a cross-vendor comparison.

What are the best alternatives to venice?

Depends on the use case. For raw cost savings, look at /pricing/cheapest-llm-api. For agent-oriented workloads, /best/best-ai-model-for-agents. For long-document workflows, /best/best-long-context-llm.

How fresh is this venice pricing data?

Daily. Our pipeline pulls models.dev each morning and rebuilds these pages on data change, so list-price moves and new model releases land within roughly 24 hours.

Last updated:

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Data is sourced from models.dev and normalized for comparison. Prices and capabilities may change. Always verify critical production decisions with the provider's official documentation.