Seed-TTS 2.0 is ByteDance's text to speech model. A text-to-speech model from ByteDance Seed offering high-quality, expressive speech synthesis as the second generation of the Seed-TTS series.
Specifications
Canonical IDbytedance-seed-tts-2
TypeText to Speech
StatusActive
CreatorByteDanceByteDance
Input ModalitiesText
Output ModalitiesAudio
Benchmarks
Elo Rating
1091
#139

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Seed 2 Lite262K$0.250$2.00Available
Seed 2 Mini262K$0.100$0.400Available
Seed-TTS 2.0Current
Seed 2 CodeAvailable
Seed 2 ProAvailable
Seed 1.6262K$0.250$2.00Available
Seed 1.6 Flash262K$0.075$0.300Available
Seed 1.8256K$0.250$2.00Available

Model IDs