TTS 1 is OpenAI's text to speech model, available from 3 providers, starting at $15.00 / 1M input. A text-to-speech model optimized for realtime speech synthesis, converting written text into natural-sounding spoken audio.
Specifications
Canonical IDopenai-tts-1
TypeText to Speech
StatusActive
CreatorOpenAIOpenAI
Providers
Input ModalitiesText
Output ModalitiesAudio
Release Date · 3 years ago
Benchmarks
Elo Rating
1091
#160

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Input
$ / 1M
Audio In
$ / 1K chars
Azure AI Foundry logo
Azure AI Foundry
azure/tts-1
N/A$0.015
OpenAI logo
OpenAI
tts-1
N/A$0.015
Vercel AI Gateway logo
Vercel AI Gateway
openai/tts-1
$15.00N/A

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Inworld Realtime TTS 2Available
Step TTS 2Available
StyleTTS 2Available
TTS HD 2.5Available
TTS 1$15.00Current
TTS 1 HD$30.00Available
Inworld Realtime TTS 1.5 MaxAvailable
Inworld Realtime TTS 1.5 MiniAvailable
Inworld TTS 1 MaxAvailable
Inworld TTS 1.5 MaxAvailable
Inworld TTS 1.5 MiniAvailable

Model IDs

azure/tts-1
openai-tts-1
openai/tts-1
tts-1
tts-1-1106