Question 1

How much does StepAudio 2.5 TTS cost?

Accepted Answer

StepAudio 2.5 TTS does not have a publicly published per-token price in our data source. This usually means it is gated behind enterprise sales or invite access. Check StepFun's official pricing page for the most current rates.

Question 2

Does StepAudio 2.5 TTS support tool calling?

Accepted Answer

No. StepAudio 2.5 TTS does not support tool calling (function calling). If your workflow requires it, look at the /capabilities/tool-calling list for alternatives.

Question 3

Does StepAudio 2.5 TTS support structured output / JSON mode?

Accepted Answer

Support for structured output / JSON-schema-constrained decoding is not reported for StepAudio 2.5 TTS in our data source. Verify with StepFun's official documentation before relying on it in production.

Question 4

Can StepAudio 2.5 TTS accept image input?

Accepted Answer

No. StepAudio 2.5 TTS only accepts text as input. If you need image input, see our /capabilities/vision list for current vision-capable models.

Question 5

Is StepAudio 2.5 TTS open-weight?

Accepted Answer

No. StepAudio 2.5 TTS is a proprietary model — only StepFun (and any approved hosting partners) can serve it. The pricing above reflects the cheapest API access.

Question 6

What are the best alternatives to StepAudio 2.5 TTS?

Accepted Answer

If StepAudio 2.5 TTS doesn't fit, consider Step 3.5 Flash, Step 3.7 Flash, Step 3.5 Flash 2603. Each one targets the same use case — see the Related links below for direct head-to-head pages.

Question 7

Where does this data come from?

Accepted Answer

All numbers are normalised into a single canonical model record and reconciled with each provider's official documentation. We re-pull daily and write any changes (price, context, capability) to the /changelog page.

StepAudio 2.5 TTS

Capabilities

Model fit scores

Cost Efficiency Index

Available on 1 providers

Frequently asked questions

More StepFun models

See also