Standard is Amazon's text to speech model. Amazon's standard text-to-speech audio model providing broad voice coverage for basic speech synthesis workloads.
Specifications
Canonical IDamazon-standard
TypeText to Speech
StatusActive
CreatorAmazonAmazon
Providers
Input ModalitiesText
Output ModalitiesAudio

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Audio In
$ / 1K chars
AWS Polly
aws_polly/standard
$0.0040

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
StandardCurrent
GenerativeAvailable
Instance SegmentationAvailable
Long FormAvailable
NeuralAvailable
TabTransformer ClassificationAvailable
TabTransformer RegressionAvailable
XGBoost ClassificationAvailable
XGBoost RegressionAvailable

Model IDs

amazon-standard
aws_polly/standard