Gemini 3.1 Flash TTS is Google's text to speech model. A text-to-speech model in the Gemini 3.1 generation delivering high-quality voice synthesis for multimodal applications.
Specifications
Canonical IDgoogle-gemini-3-1-tts
TypeText to Speech
StatusActive
CreatorGoogleGoogle
Input ModalitiesText
Output ModalitiesAudio
Benchmarks
Elo Rating
1210
#32

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Gemini 3.5 Flash1.0M$1.50$9.00Available
Gemini 3.1 Flash Lite1.0M$0.250$1.50Available
Gemini 3.1 Flash Lite Preview1.0M$0.250$1.50Available
Gemini 3.1 Pro Preview Custom Tools1.0M$2.00$12.00Available
Gemini 3.1 Pro Preview1.0M$2.00$12.00Available
Gemini 3 Flash1.0M$0.500$3.00Available
Gemini 3 Flash Preview1.0M$0.500$3.00Available
Gemini 3 Pro Preview1.0M$2.00$12.00Deprecated
Gemini 3 Pro Image66K$2.00$12.00Available
Gemini 3.1 Flash TTSCurrent
Gemini 3 Pro$2.00$12.00Available

Model IDs