$0.180 輸入 / $0.450 輸出
- 上下文: 2M
- 服務商: 7
- 工具呼叫
- 推理
- 視覺
AI 模型推薦 · 2026-06-29
支援 image input + text output 的視覺語言模型推薦。涵蓋 OCR、UI 截圖分析、圖表理解、產品照片分析等多模態場景。
$0.180 輸入 / $0.450 輸出
$0.180 輸入 / $0.450 輸出
$0.170 輸入 / $0.660 輸出
$0.170 輸入 / $0.660 輸出
$1.25 輸入 / $2.50 輸出
$1.25 輸入 / $2.50 輸出
$0.075 輸入 / $0.300 輸出
$0.090 輸入 / $0.360 輸出
$0.080 輸入 / $0.400 輸出
$0.100 輸入 / $0.400 輸出
Same shortlist sliced four ways — pick the tier that matches your budget and constraints.
Lowest total per-1M-token cost in this list ($0.38).
Lowest-cost option that still meets the use case. Pick this when you have high volume or strict unit-economics.
Median price ($0.63) — typically the safest default.
Good-enough quality at a mid-tier price. The default choice for most production apps.
Highest-priced pick in the list ($3.75) — usually the flagship.
Highest-capability model in this list. Pick when accuracy or reasoning matters more than cost.
Open weights and the cheapest in that subset ($0.48).
Open weights — self-host on your own GPUs, fine-tune on private data, run offline. Pricing here reflects the cheapest API host.
目前我們把 xAI 的 Grok 4 Fast (Reasoning) 排在第一,主要原因是它支援圖像輸入、定價公開,且在視覺模型中擁有最佳的上下文 / 成本比。排名根據實時模型 metadata 自動重算 —— 詳細規則見上方「我們的篩選邏輯」。
Gemini 2.0 Flash-Lite(Google)是榜單中價格最低的,輸入每百萬 token $0.075,輸出每百萬 token $0.300。其它入選項的價格依次往上遞增。
每一項都來自我們 use-case-rules 設定中的一條程式化規則:先用硬過濾條件(例如工具呼叫必選、上下文 ≥ 100K)篩選,再用一組結合能力、上下文視窗與價格的數值打分。我們從不手動調整排序,但會手動迭代規則。底層模型資料每日從我們歸一化後的 canonical 目錄同步。
底層模型資料每天刷新一次,資料有變更時靜態頁面會重新建置。下方「最近更新」標註的就是最近一次建置日期。
最近更新:
Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.
Pricing and capabilities are refreshed daily and reconciled against each provider's official documentation. Always verify critical production decisions with the provider directly.