Chirp is Google's text to speech model. Google's large-scale automatic speech recognition model supporting a wide range of languages with high transcription accuracy.
Specifications
Canonical IDgoogle-chirp
TypeText to Speech
StatusActive
CreatorGoogleGoogle
Providers
Input ModalitiesText
Output ModalitiesAudio

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Audio In
$ / 1M
Google Vertex AI logo
Google Vertex AI
vertex_ai/chirp
$0.030

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Chirp 3Available
Chirp 3: HDAvailable
Chirp 2Available
ChirpCurrent

Model IDs