Nova 2 Multimodal Embeddings is Amazon's embedding model with a 8K context window, starting at $0.135 / 1M input. Amazon Nova 2 embedding model that encodes both text and image inputs into a shared vector space for multimodal retrieval.
Specifications
Canonical IDamazon-nova-2-multimodal-embeddings
TypeEmbedding
StatusActive
CreatorAmazonAmazon
Providers
Context Window8K tokens
Input ModalitiesAudioImageTextVideo
Output ModalitiesEmbedding
Embedding Dimensions3072
Release Date · 7 months ago

Capabilities

Input4/5
Text
Image
Audio
Video
PDF·
Output1/5
Text·
Image·
Audio·
Video·
Embedding
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Text Embedding 52K$0.025Available
Embed 4128K$0.120$0.470Available
Embed 4 Img$0.470Available
Embed 4 Txt$0.120Available
Text Embedding 42K$0.100Deprecated
Voyage 432K$0.060Available
Voyage 4 Large32K$0.120Available
Voyage 4 Lite32K$0.020Available
Voyage 3.532K$0.060Available
Voyage 3.5 Lite32K$0.020Available
Nova 2 Multimodal Embeddings8K$0.135Current

Model IDs