TTS 1 HD is OpenAI's text to speech model, available from 3 providers, starting at $30.00 / 1M input. A high-quality text-to-speech model optimized for audio fidelity, producing natural-sounding speech for non-realtime use cases.
Specifications
Canonical IDopenai-tts-1-hd
TypeText to Speech
StatusActive
CreatorOpenAIOpenAI
Providers
Input ModalitiesText
Output ModalitiesAudio
Release Date · 3 years ago
Benchmarks
Elo Rating
1099
#152

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Input
$ / 1M
Audio In
$ / 1K chars
Azure AI Foundry logo
Azure AI Foundry
azure/tts-1-hd
N/A$0.030
OpenAI logo
OpenAI
tts-1-hd
N/A$0.030
Vercel AI Gateway logo
Vercel AI Gateway
openai/tts-1-hd
$30.00N/A

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Inworld Realtime TTS 2Available
Step TTS 2Available
StyleTTS 2Available
TTS HD 2.5Available
TTS 1 HD$30.00Current
TTS 1$15.00Available
Inworld Realtime TTS 1.5 MaxAvailable
Inworld Realtime TTS 1.5 MiniAvailable
Inworld TTS 1 MaxAvailable
Inworld TTS 1.5 MaxAvailable
Inworld TTS 1.5 MiniAvailable

Model IDs

azure/tts-1-hd
openai-tts-1-hd
openai/tts-1-hd
tts-1-hd
tts-1-hd-1106