Name: Mercury 2
Brand: Inception

Mercury 2 is Inception's language model with a 128K context window and up to 50K output tokens, available from 2 providers, starting at $0.250 / 1M input and $0.750 / 1M output. A fast reasoning diffusion LLM from Inception Labs that produces and refines multiple tokens in parallel, combining reasoning capability with diffusion-based speed.

Specifications
Canonical ID	`inception-mercury-2`
Type	Language
Status	Active
Creator	Inception
Providers	OpenRouter Vercel AI Gateway
Context Window	128K tokens
Max Output	50K tokens
Input Modalities	Text
Output Modalities	Text
Reasoning Efforts	default
Release Date	2026-03-04 · 3 months ago

Benchmarks
Intelligence Index	32.8 #115
Coding Index	30.6 #102
GPQA	0.8 #121
HLE	0.2 #82
IFBench	0.7 #56
Time to First Token	2.96s #424
SciCode	0.4 #112
LCR	0.4 #178
TerminalBench Hard	0.3 #104
TAU2	0.7 #124
Output TPS	718.8 #1

Capabilities

Input1/5

Text✓

Image·

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities4/13

Reasoning✓

Adaptive Reasoning·

Function Calling✓

Parallel Function Calling·

Structured Outputs✓

Native JSON Schema✓

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Pricing by Provider

Provider	Standard
Provider	Input $ / 1M	Output $ / 1M	Cache Read $ / 1M
OpenRouter inception/mercury-2	$0.250	$0.750	$0.025
Vercel AI Gateway inception/mercury-2	$0.250	$0.750	$0.025

Cost Calculator

Preset:

Input tokens

Output tokens

Reasoning tokens

Number of calls

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
Mercury 2	2026-03-04	128K	$0.250	$0.750	Current
Mercury Coder Small Beta	2025-02-26	32K	$0.250	$1.00	Available
Mercury	—	—	—	—	Available
Mercury Coder	—	—	—	—	Available

Mercury 2

Capabilities

Pricing by Provider

Cost Calculator

Versions

Model IDs