Standard is Amazon's text to speech model. Amazon's standard text-to-speech audio model providing broad voice coverage for basic speech synthesis workloads.
Specifications
Canonical IDamazon-standard
TypeText to Speech
StatusActive
CreatorAmazonAmazon
Providers
Input ModalitiesText
Output ModalitiesAudio

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Audio In
$ / 1M
Other/AWS Polly
aws_polly/standard
$0.0040

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
StandardCurrent
GenerativeAvailable
Instance SegmentationAvailable
Long FormAvailable
NeuralAvailable
TabTransformer ClassificationAvailable
TabTransformer RegressionAvailable
XGBoost ClassificationAvailable
XGBoost RegressionAvailable

Model IDs