Gemini Embedding 2 is Google's embedding model with a 8K context window and up to 128 output tokens, available from 3 providers, starting at $0.200 / 1M input. Google's first fully multimodal embedding model capable of mapping text, images, video, audio, and PDFs into a unified vector space.
Capabilities
Input5/5
Text✓
Image✓
Audio✓
Video✓
PDF✓
Output1/5
Text·
Image·
Audio·
Video·
Embedding✓
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·
Pricing by Provider
| Provider | Standard | Batch | ||
|---|---|---|---|---|
| Input $ / 1M | Audio In $ / 1M | Input $ / 1M | Audio In $ / 1M | |
Google Gemini | $0.200 | $6.50 | $0.100 | $3.25 |
Google Vertex AI | $0.200 | $6.50 | $0.100 | $3.25 |
Vercel AI Gateway | $0.200 | N/A | — | — |
Cost Calculator
Preset:
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| Text Embedding 5 | 2K | $0.025 | — | Available | |
| Embed 4 | 128K | $0.120 | $0.470 | Available | |
| Embed 4 Img | — | — | $0.470 | — | Available |
| Embed 4 Txt | — | — | $0.120 | — | Available |
| Text Embedding 4 | — | 2K | $0.100 | — | Deprecated |
| Voyage 4 | — | 32K | $0.060 | — | Available |
| Voyage 4 Large | — | 32K | $0.120 | — | Available |
| Voyage 4 Lite | — | 32K | $0.020 | — | Available |
| Voyage 3.5 | 32K | $0.060 | — | Available | |
| Voyage 3.5 Lite | 32K | $0.020 | — | Available | |
| Gemini Embedding 2 | 8K | $0.200 | — | Current |