OpenAI logo

TTS


TTS is OpenAI logoOpenAI's text to speech model, starting at $N/A / 1M input and $N/A / 1M output. OpenAI's text-to-speech model that converts text into natural-sounding spoken audio for a variety of voice applications.
Spec
Canonical IDopenai-tts
TypeText to Speech
StatusActive
CreatorOpenAIOpenAI
Providers
Input ModalitiesText
Output ModalitiesAudio

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Audio In
$ / 1M
Azure AI Foundry logo
Azure AI Foundry
speech/azure-tts
$0.015

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Step TTS 2Available
StyleTTS 2Available
Inworld TTS 1.5 MaxAvailable
Inworld TTS 1.5 MiniAvailable
Inworld TTS 1 MaxAvailable
TTS-1$0.000$0.000Available
TTS-1 HD$0.000$0.000Available
TTS$0.000$0.000Current
Inworld TTS 1Available
LMNTAvailable
Microsoft TTSAvailable

Model IDs