Voyage Multimodal 3 is Voyage's embedding model with a 32K context window, available from 2 providers, starting at $0.12 / 1M input and $0.12 / 1M output. Voyage AI's multimodal embedding model supporting joint text and image retrieval for cross-modal AI applications.
Capabilities
Input1/5
Text✓
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio·
Video·
Embedding✓
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·
Pricing by Provider
US Dollar ($)
Per 1M tokens
Cost Calculator
US Dollar ($)
Preset:
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| Text Embedding 5 | 2K | $0.025 | — | Available | |
| Embed 4 | 128K | $0.120 | $0.470 | Available | |
| Embed 4 Img | — | — | $0.470 | — | Available |
| Embed 4 Txt | — | — | $0.120 | — | Available |
| Text Embedding 4 | — | 2K | $0.100 | — | Deprecated |
| Voyage 4 | — | 32K | $0.060 | — | Available |
| Voyage 4 Large | — | 32K | $0.120 | — | Available |
| Voyage 4 Lite | — | 32K | $0.020 | — | Available |
| Voyage 3.5 | 32K | $0.060 | — | Available | |
| Voyage 3.5 Lite | 32K | $0.020 | — | Available | |
| Voyage Multimodal 3 | — | 32K | $0.120 | $0.120 | Current |