Embed 3 Multilingual Image is Cohere's embedding model with a 512 context window, starting at $0.100 / 1M input. A multilingual image embedding model that generates vector representations for cross-lingual visual search and retrieval tasks.
Specifications
Canonical IDcohere-embed-3-multilingual-image
TypeEmbedding
StatusActive
CreatorCohereCohere
Providers
Context Window512 tokens
Input ModalitiesImage
Output ModalitiesEmbedding
Embedding Dimensions1024

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio·
Video·
Embedding
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Text Embedding 52K$0.025Available
Embed 4128K$0.120$0.470Available
Embed 4 Img$0.470Available
Embed 4 Txt$0.120Available
Text Embedding 42K$0.100Deprecated
Voyage 432K$0.060Available
Voyage 4 Large32K$0.120Available
Voyage 4 Lite32K$0.020Available
Voyage 3.532K$0.060Available
Voyage 3.5 Lite32K$0.020Available
Embed 3 Multilingual Image512$0.100Current

Model IDs