Step TTS 2
stepfun-ai/step-tts-2Von stepfun-ai · Familie: step · veröffentlicht 2026-03-01
Prices in USD per 1M tokens. Unknown means the provider does not publish per-token pricing.
Fähigkeiten
Model fit scores
0–100 · higher is betterThese scores reward declared capabilities, context size, price and provider availability — they are not benchmark results. Use them as a directional signal alongside your own evaluation.
Coding1
- Tool calling0/40
- Structured output0/20
- Reasoning0/10
- Context window (100K → 1M)0/20
- Provider availability1/10
Agents1
- Tool calling0/35
- Structured output0/25
- Reasoning0/15
- Output token limit0/15
- Provider availability1/10
JSON / structured output0
- Structured output / JSON mode0/50
- Tool calling0/20
- Temperature control0/10
- Price-friendly for high-volume0/20
Cost efficiency0
- Has published price0/100
Long context0
- Context ≥ 100K0/100
Production-readiness30
- Number of independent providers5/40
- Has published per-token price0/20
- Context window ≥ 8K0/15
- No data inconsistencies across providers10/10
- Official model (not derivative)15/15
Cost Efficiency Index
Open full calculator →Estimated cost using the recommended provider's headline rate. Each scenario fixes average input/output tokens — the assumptions are shown in the third column.
This model has no published per-token price, so we can't compute a cost estimate. See the provider's official pricing page for current rates.
Bei 1 Anbietern verfügbar
| Anbieter | Anbieter-Modell-ID | Eingabe / 1M | Ausgabe / 1M | Kontext | Veröffentlicht |
|---|---|---|---|---|---|
| StepFun AI stepfun-ai | step-tts-2 | Unknown | Unknown | Unknown | 2026-03-01 |
Frequently asked questions
How much does Step TTS 2 cost?
Step TTS 2 does not have a publicly published per-token price in our data source. This usually means it is gated behind enterprise sales or invite access. Check stepfun-ai's official pricing page for the most current rates.
Does Step TTS 2 support tool calling?
No. Step TTS 2 does not support tool calling (function calling). If your workflow requires it, look at the /capabilities/tool-calling list for alternatives.
Does Step TTS 2 support structured output / JSON mode?
Support for structured output / JSON-schema-constrained decoding is not reported for Step TTS 2 in our data source. Verify with stepfun-ai's official documentation before relying on it in production.
Can Step TTS 2 accept image input?
No. Step TTS 2 only accepts text as input. If you need image input, see our /capabilities/vision list for current vision-capable models.
Is Step TTS 2 open-weight?
No. Step TTS 2 is a proprietary model — only stepfun-ai (and any approved hosting partners) can serve it. The pricing above reflects the cheapest API access.
What are the best alternatives to Step TTS 2?
If Step TTS 2 doesn't fit, consider Step 2 (16K), Step 3.5 Flash, StepAudio 2.5 ASR. Each one targets the same use case — see the Related links below for direct head-to-head pages.
Where does this data come from?
All numbers are normalised into a single canonical model record and reconciled with each provider's official documentation. We re-pull daily and write any changes (price, context, capability) to the /changelog page.
Explore more
More stepfun-ai models
- Step 2 (16K)$5.21 in / $16.44 out
- Step 3.5 Flash$0.10 in / $0.30 out
- StepAudio 2.5 ASRUnknown pricing
- StepAudio 2.5 TTSUnknown pricing
- Step 3.5 Flash 2603$0.10 in / $0.30 out
Zuletzt aktualisiert:
Pricing and capabilities are refreshed daily and reconciled against each provider's official documentation. Always verify critical production decisions with the provider directly.