Tarifs · 2026-06-29

Les APIs LLM les moins chères

Toutes les APIs LLM texte-entrée / texte-sortie classées de la moins chère à la plus chère.

À propos de cette liste

Classement par prix total par token (input + output).
Les modèles à prix $0 fictif (tiers promotionnels, miroirs Github Copilot) sont exclus — « Inconnu » ne signifie pas gratuit.
Utilisez cette liste pour trouver le fournisseur le moins cher répondant à vos besoins de contexte et de fonctionnalités.

#	Modèle	Éditeur	Entrée / 1M	Sortie / 1M	Total	Contexte
1	BGE Reranker Base	cloudflare-ai-gateway	$0.003	Unknown	$0.003	128K
2	Voxtral Small 24B 2507	Mistral	$0.002	$0.002	$0.005	32K
3	All-MiniLM-L6-v2	digitalocean	$0.009	Unknown	$0.009	256
4	Multi-QA-mpnet-base-dot-v1	digitalocean	$0.009	Unknown	$0.009	512
5	Qwen3 Embedding 8B	Alibaba (Qwen)	$0.010	Unknown	$0.010	33K
6	Qwen3 Embedding 0.6B	Alibaba (Qwen)	$0.010	Unknown	$0.010	33K
7	Qwen3 Embedding 4B	Alibaba (Qwen)	$0.010	Unknown	$0.010	33K
8	BGE Reranker v2 M3	digitalocean	$0.010	Unknown	$0.010	8K
9	BGE M3	cloudflare-ai-gateway	$0.012	Unknown	$0.012	128K
10	PLaMo Embedding 1B	cloudflare-ai-gateway	$0.019	Unknown	$0.019	128K
11	Llama 3.2 1B Instruct	Meta	$0.010	$0.010	$0.020	60K
12	text-embedding-3-small	OpenAI	$0.020	Unknown	$0.020	8K
13	llama-3.1-nemotron-safety-guard-8b-v3	NVIDIA	$0.010	$0.010	$0.020	128K
14	Prompt Guard 2 86M	Meta	$0.010	$0.010	$0.020	512
15	Llama Prompt Guard 2 22M	Meta	$0.010	$0.010	$0.020	512
16	E5 Large v2	digitalocean	$0.020	Unknown	$0.020	512
17	BGE M3	digitalocean	$0.020	Unknown	$0.020	8K
18	BGE Small EN v1.5	cloudflare-ai-gateway	$0.020	Unknown	$0.020	128K
19	text-embedding-3-small	azure	$0.020	Unknown	$0.020	8K
20	text-embedding-3-small	azure-cognitive-services	$0.020	Unknown	$0.020	8K
21	DistilBERT SST-2 INT8	cloudflare-ai-gateway	$0.026	Unknown	$0.026	128K
22	Llama 3.2 3B Instruct	Meta	$0.020	$0.020	$0.040	80K
23	PaddleOCR-VL	novita-ai	$0.020	$0.020	$0.040	16K
24	Ling-2.6-flash	openrouter	$0.010	$0.030	$0.040	262K
25	Meta-Llama-3.1-8B-Instruct	Meta	$0.020	$0.030	$0.050	128K
26	Nomic Embed Text v1.5	tinfoil	$0.050	Unknown	$0.050	8K
27	Meta Llama 3.1 8B Instruct Turbo	Meta	$0.020	$0.030	$0.050	128K
28	Mistral Nemo	Mistral	$0.020	$0.040	$0.060	128K
29	Gemma 3n 4B	Google	$0.020	$0.040	$0.060	33K
30	BGE Base EN v1.5	cloudflare-ai-gateway	$0.067	Unknown	$0.067	128K
31	Meta-Llama-3-8B-Instruct	Meta	$0.030	$0.040	$0.070	8K
32	Llama Guard 3 8B	Meta	$0.020	$0.060	$0.080	131K
33	Ministral 3B (latest)	Mistral	$0.040	$0.040	$0.080	128K
34	Ministral 3B	azure	$0.040	$0.040	$0.080	128K
35	Ministral 3B	azure-cognitive-services	$0.040	$0.040	$0.080	128K
36	Llama 3 8B Lunaris	Meta	$0.040	$0.050	$0.090	8K
37	GTE Large (v1.5)	digitalocean	$0.090	Unknown	$0.090	8K
38	Llama-3.2-11B-Vision-Instruct	Meta	$0.049	$0.049	$0.098	128K
39	Mistral Embed	Mistral	$0.100	Unknown	$0.100	8K
40	text-embedding-ada-002	OpenAI	$0.100	Unknown	$0.100	8K
41	L3 8B Stheno V3.2	novita-ai	$0.050	$0.050	$0.100	8K
42	Sao10k L3 8B Lunaris	novita-ai	$0.050	$0.050	$0.100	8K
43	text-embedding-ada-002	azure	$0.100	Unknown	$0.100	8K
44	text-embedding-ada-002	azure-cognitive-services	$0.100	Unknown	$0.100	8K
45	Gemma 3 4B IT	Google	$0.040	$0.080	$0.120	128K
46	MythoMax 13B	kilo	$0.060	$0.060	$0.120	4K
47	MythoMax 13B	openrouter	$0.060	$0.060	$0.120	4K
48	Sarvam 30B	fastrouter	$0.020	$0.100	$0.120	128K
49	IBM Granite 4.0 H Micro	cloudflare-ai-gateway	$0.017	$0.110	$0.127	128K
50	IBM: Granite 4.0 Micro	kilo	$0.017	$0.110	$0.127	131K
51	Granite 4.0 H Micro	cloudflare-workers-ai	$0.017	$0.112	$0.129	131K
52	Granite 4.0 Micro	openrouter	$0.017	$0.112	$0.129	131K
53	text-embedding-3-large	OpenAI	$0.130	Unknown	$0.130	8K
54	Llama 3.1 8B	Meta	$0.050	$0.080	$0.130	131K
55	text-embedding-3-large	azure	$0.130	Unknown	$0.130	8K
56	text-embedding-3-large	azure-cognitive-services	$0.130	Unknown	$0.130	8K
57	Sarvam 30B	nano-gpt	$0.028	$0.111	$0.139	66K
58	Google Gemma 3 27B Instruct	Google	$0.030	$0.110	$0.140	203K
59	baichuan-m2-32b	novita-ai	$0.070	$0.070	$0.140	131K
60	Model Router	azure	$0.140	Unknown	$0.140	128K
61	Model Router	azure-cognitive-services	$0.140	Unknown	$0.140	128K
62	Google Gemma 3 12B	Google	$0.050	$0.100	$0.150	131K
63	Gemini Embedding 001	Google	$0.150	Unknown	$0.150	2K
64	LiquidAI: LFM2-24B-A2B	kilo	$0.030	$0.120	$0.150	33K
65	LFM2-24B-A2B	togetherai	$0.030	$0.120	$0.150	33K
66	LFM2-24B-A2B	openrouter	$0.030	$0.120	$0.150	33K
67	LFM2 24B A2B	nano-gpt	$0.030	$0.120	$0.150	33K
68	IBM: Granite 4.1 8B	kilo	$0.050	$0.100	$0.150	131K
69	Granite 4.1 8B	openrouter	$0.050	$0.100	$0.150	131K
70	Granite 4.1 8B	nano-gpt	$0.050	$0.100	$0.150	131K
71	DeepSeek R1 Distill Llama 70B	Meta	$0.030	$0.130	$0.160	33K
72	gpt-oss-20b	OpenAI	$0.029	$0.140	$0.169	128K
73	R1 Distill Llama 70B	DeepSeek	$0.030	$0.140	$0.170	8K
74	Qwen3 235B A22B 2507	Alibaba (Qwen)	$0.071	$0.100	$0.171	262K
75	Nova Micro	vercel	$0.035	$0.140	$0.175	128K
76	Amazon: Nova Micro 1.0	kilo	$0.035	$0.140	$0.175	128K
77	Nova Micro	amazon-bedrock	$0.035	$0.140	$0.175	128K
78	Nova Micro 1.0	openrouter	$0.035	$0.140	$0.175	128K
79	Amazon Nova Micro 1.0	nano-gpt	$0.036	$0.139	$0.175	128K
80	gpt-oss-120b	OpenAI	$0.030	$0.150	$0.180	128K

Top 80 sur 977 affichés. Voir le reste dans le répertoire complet.

Frequently asked questions

What is the cheapest LLM API right now?

BGE Reranker Base is the lowest-priced AI APIs on this list, at $0.003 per 1M input tokens and Unknown per 1M output tokens. The total column above sums input + output per 1M tokens for direct comparison.

Why are prices shown per 1M tokens?

Almost every commercial LLM provider publishes rates per million tokens. Per-1K-tokens or per-token rates are easy to misread (off by 1000×). Standardising on $/1M lets you compare vendors directly.

Are these prices including or excluding tax?

All prices are the provider's headline list rate in USD, before any volume discounts, prepaid credits, batch-API discounts, prompt-caching discounts or jurisdictional sales tax. Always verify with the provider before committing to a budget.

How often do these prices change?

Vendor list-price moves are typically picked up within hours of an announcement, and our pipeline re-syncs daily. Each change is also written to /changelog so you can audit historical pricing over time.

Why are some models showing 'Unknown' instead of a price?

We deliberately do not coerce missing data to $0. 'Unknown' means the provider does not publish a public rate (often models behind enterprise sales or invite-only access). Treating Unknown as free would push paid-but-unpriced models to the top of every cheap list — broken UX and broken SEO.

Top picks · model details

BGE Reranker Base$0.00 in / $0.00 out
Voxtral Small 24B 2507$0.00 in / $0.00 out
All-MiniLM-L6-v2$0.01 in / $0.00 out
Multi-QA-mpnet-base-dot-v1$0.01 in / $0.00 out
Qwen3 Embedding 8B$0.01 in / $0.00 out

Other pricing comparisons

Best-of lists

Capability comparisons

Tools

Dernière mise à jour : 2026-06-29

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

Pricing and capabilities are refreshed daily and reconciled against each provider's official documentation. Always verify critical production decisions with the provider directly.