Long Form is Amazon's text to speech model. Amazon's TTS audio speech model optimized for generating long-form spoken audio content.
Specifications
Canonical IDamazon-long-form
TypeText to Speech
StatusActive
CreatorAmazonAmazon
Providers
Input ModalitiesText
Output ModalitiesAudio

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Audio In
$ / 1K chars
AWS Polly
aws_polly/long-form
$0.100

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Long FormCurrent
GenerativeAvailable
Instance SegmentationAvailable
NeuralAvailable
StandardAvailable
TabTransformer ClassificationAvailable
TabTransformer RegressionAvailable
XGBoost ClassificationAvailable
XGBoost RegressionAvailable

Model IDs

amazon-long-form
aws_polly/long-form