Amazon logo

Long Form


Long Form is Amazon logoAmazon's text to speech model, starting at $N/A / 1M input and $N/A / 1M output. Amazon's TTS audio speech model optimized for generating long-form spoken audio content.
Spec
Canonical IDamazon-long-form
TypeText to Speech
StatusActive
CreatorAmazonAmazon
Providers
Input ModalitiesText
Output ModalitiesAudio

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Audio In
$ / 1M
Other/AWS Polly
aws_polly/long-form
$0.100

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Long Form$0.000$0.000Current
Generative$0.000$0.000Available
Instance SegmentationAvailable
Neural$0.000$0.000Available
Standard$0.000$0.000Available
TabTransformer ClassificationAvailable
TabTransformer RegressionAvailable
XGBoost ClassificationAvailable
XGBoost RegressionAvailable

Model IDs