Mercury 2 is Inception's language model with a 128K context window and up to 50K output tokens, available from 2 providers, starting at $0.250 / 1M input and $0.750 / 1M output. A fast reasoning diffusion LLM from Inception Labs that produces and refines multiple tokens in parallel, combining reasoning capability with diffusion-based speed.
Specifications
Canonical IDinception-mercury-2
TypeLanguage
StatusActive
CreatorInceptionInception
Providers
Context Window128K tokens
Max Output50K tokens
Input ModalitiesText
Output ModalitiesText
Reasoning Effortsdefault
Release Date · 3 months ago
Benchmarks
Intelligence Index
32.8
#115
Coding Index
30.6
#102
GPQA
0.8
#121
HLE
0.2
#82
IFBench
0.7
#56
Time to First Token
2.96s
#424
SciCode
0.4
#112
LCR
0.4
#178
TerminalBench Hard
0.3
#104
TAU2
0.7
#124
Output TPS

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities4/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
OpenRouter logo
OpenRouter
inception/mercury-2
$0.250$0.750$0.025
Vercel AI Gateway logo
Vercel AI Gateway
inception/mercury-2
$0.250$0.750$0.025

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Mercury 2128K$0.250$0.750Current
Mercury Coder Small Beta32K$0.250$1.00Available
MercuryAvailable
Mercury CoderAvailable

Model IDs