Google logo

Gemini 3.1 Flash TTS


Gemini 3.1 Flash TTS is Google logoGoogle's text to speech model. A text-to-speech model in the Gemini 3.1 generation delivering high-quality voice synthesis for multimodal applications.
Spec
Canonical IDgoogle-gemini-3-1-tts
TypeText to Speech
StatusActive
CreatorGoogleGoogle
Input ModalitiesText
Output ModalitiesAudio
Elo Rating
1203
#34

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Gemini 3.1 Flash Lite Preview1.0M$0.250$1.50Available
Gemini 3.1 Pro Preview Custom Tools1.0M$2.00$12.00Available
Gemini 3.1 Pro Preview1.0M$2.00$12.00Available
Gemini 3 Flash1.0M$0.500$3.00Available
Gemini 3 Flash Preview1.0M$0.500$3.00Available
Gemini 3 Pro Preview1.0M$2.00$12.00Deprecated
Gemini 3 Pro Image66K$2.00$12.00Available
Gemini 3.1 Flash TTSCurrent
Gemini 3 Deep ThinkAvailable
Gemini 3 Flash Live$0.500$3.00Available
Gemini 3 Flash Live Live$0.750$4.50Available

Model IDs