MiniMax logo

Speech-02-HD


Speech-02-HD is MiniMax logoMiniMax's text to speech model, starting at $N/A / 1M input and $N/A / 1M output. The high-definition tier of MiniMax's second-generation text-to-speech model, designed for premium-quality voice output.
Spec
Canonical IDminimax-speech-2-hd
TypeText to Speech
StatusActive
CreatorMiniMaxMiniMax
Providers
Input ModalitiesText
Output ModalitiesAudio
Elo Rating
1121
#88

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Audio In
$ / 1M
MiniMax logo
MiniMax
speech-02-hd
$0.100

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Speech 2.8 HDAvailable
Speech 2.8 TurboAvailable
Speech 2.6 HD$0.000$0.000Available
Speech 2.6 Turbo$0.000$0.000Available
Speech-02-HD$0.000$0.000Current
Speech-02-Turbo$0.000$0.000Available
Fish Speech 1.5Available

Model IDs