Mercury 2 is Inception's language model with a 128K context window and up to 50K output tokens, available from 2 providers, starting at $0.250 / 1M input and $0.750 / 1M output. A fast reasoning diffusion LLM from Inception Labs that produces and refines multiple tokens in parallel, combining reasoning capability with diffusion-based speed.
Specifications
Canonical IDinception-mercury-2
TypeLanguage
StatusActive
CreatorInceptionInception
Providers
Context Window128K tokens
Max Output50K tokens
Input ModalitiesText
Output ModalitiesText
Reasoning Effortsdefault
Release Date · 3 months ago
Benchmarks
Intelligence Index
32.8
#117
Coding Index
30.6
#104
GPQA
0.8
#123
HLE
0.2
#84
IFBench
0.7
#56
Time to First Token
2.68s
#429
SciCode
0.4
#114
LCR
0.4
#180
TerminalBench Hard
0.3
#106
TAU2
0.7
#127
Output TPS

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities4/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
OpenRouter logo
OpenRouter
inception/mercury-2
$0.250$0.750$0.025
Vercel AI Gateway logo
Vercel AI Gateway
inception/mercury-2
$0.250$0.750$0.025

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Mercury 2128K$0.250$0.750Current
Mercury Coder Small Beta32K$0.250$1.00Available
MercuryAvailable
Mercury CoderAvailable

Model IDs