AI 모델 인텔리전스

Phi 4 Multimodal

nvidia/phi-4-multimodal-instruct

제공: NVIDIA · 출시 2025-07-26

⚠ 이 모델은 커뮤니티 파인튜닝 / 파생본으로, 벤더 공식 릴리스가 아닙니다.

미공개
입력 / 1M 토큰
미공개
출력 / 1M 토큰
128K
컨텍스트 창
16K
최대 출력

Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.

기능

도구 호출추론구조화 출력첨부오픈 웨이트? 온도 제어
모달리티: 입력 text · 출력 text

Model fit scores

0–100 · higher is better

These scores reward declared capabilities, context size, price and provider availability — they are not benchmark results. Use them as a directional signal alongside your own evaluation.

Coding3
  • Tool calling0/40
  • Structured output0/20
  • Reasoning0/10
  • Context window (100K → 1M)2/20
  • Provider availability1/10
Agents11
  • Tool calling0/35
  • Structured output0/25
  • Reasoning0/15
  • Output token limit10/15
  • Provider availability1/10
JSON / structured output0
  • Structured output / JSON mode0/50
  • Tool calling0/20
  • Temperature control0/10
  • Price-friendly for high-volume0/20
Cost efficiency0
  • Has published price0/100
Long context35
  • Context window (100K → 2M)35/90
  • Has published price for full window0/10
Production-readiness30
  • Number of independent providers5/40
  • Has published per-token price0/20
  • Context window ≥ 8K15/15
  • No data inconsistencies across providers10/10
  • Official model (not derivative)0/15

Cost Efficiency Index

Open full calculator →

Estimated cost using the recommended provider's headline rate. Each scenario fixes average input/output tokens — the assumptions are shown in the third column.

This model has no published per-token price, so we can't compute a cost estimate. See the provider's official pricing page for current rates.

1곳 제공사에서 이용 가능

제공자제공자 모델 ID입력 / 1M출력 / 1M컨텍스트출시일
Nvidia
nvidia
microsoft/phi-4-multimodal-instructUnknownUnknown128K2025-07-26

Frequently asked questions

How much does Phi 4 Multimodal cost?

Phi 4 Multimodal does not have a publicly published per-token price in our data source. This usually means it is gated behind enterprise sales or invite access. Check NVIDIA's official pricing page for the most current rates.

What is the context window of Phi 4 Multimodal?

Phi 4 Multimodal has a context window of 128K tokens, with a max output of 16K tokens per reply. This is the total combined size of prompt + completion.

Does Phi 4 Multimodal support tool calling?

No. Phi 4 Multimodal does not support tool calling (function calling). If your workflow requires it, look at the /capabilities/tool-calling list for alternatives.

Does Phi 4 Multimodal support structured output / JSON mode?

No. Phi 4 Multimodal does not support structured output / JSON-schema-constrained decoding. If your workflow requires it, look at the /capabilities/structured-output list for alternatives.

Can Phi 4 Multimodal accept image input?

No. Phi 4 Multimodal only accepts text as input. If you need image input, see our /capabilities/vision list for current vision-capable models.

Is Phi 4 Multimodal open-weight?

No. Phi 4 Multimodal is a proprietary model — only NVIDIA (and any approved hosting partners) can serve it. The pricing above reflects the cheapest API access.

What are the best alternatives to Phi 4 Multimodal?

If Phi 4 Multimodal doesn't fit, consider Nemotron 3 Super, nemotron-3-nano-30b-a3b, Nemotron 3 Ultra 550B A55B. Each one targets the same use case — see the Related links below for direct head-to-head pages.

Where does this data come from?

All numbers are normalised into a single canonical model record and reconciled with each provider's official documentation. We re-pull daily and write any changes (price, context, capability) to the /changelog page.

More NVIDIA models

마지막 업데이트:

Pricing and capabilities are refreshed daily and reconciled against each provider's official documentation. Always verify critical production decisions with the provider directly.