Long Form is Amazon's text to speech model. Amazon's TTS audio speech model optimized for generating long-form spoken audio content.
Specifications
Canonical IDamazon-long-form
TypeText to Speech
StatusActive
CreatorAmazonAmazon
Providers
Input ModalitiesText
Output ModalitiesAudio

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Audio In
$ / 1M
Other/AWS Polly
aws_polly/long-form
$0.100

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Long FormCurrent
GenerativeAvailable
Instance SegmentationAvailable
NeuralAvailable
StandardAvailable
TabTransformer ClassificationAvailable
TabTransformer RegressionAvailable
XGBoost ClassificationAvailable
XGBoost RegressionAvailable

Model IDs