Mercury 2 is Inception's language model with a 128K context window and up to 50K output tokens, available from 3 providers, starting at $0.25 / 1M input and $0.75 / 1M output. A fast reasoning diffusion LLM from Inception Labs that produces and refines multiple tokens in parallel, combining reasoning capability with diffusion-based speed.
Specifications
Canonical IDinception-mercury-2
TypeLanguage
StatusActive
CreatorInceptionInception
Providers
Context Window128K tokens
Max Output50K tokens
Input ModalitiesText
Output ModalitiesText
Reasoning Effortsdefault
Release Date · 4 months ago
Benchmarks
Intelligence Index
25.3
#118
Coding Index
30.6
#108
GPQA
0.8
#127
HLE
0.2
#88
IFBench
0.7
#60
Time to First Token
3.16s
#432
SciCode
0.4
#118
LCR
0.4
#185
TerminalBench Hard
0.3
#110
TAU2
0.7
#132
Output TPS

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities5/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
Inception logo
Inception
inception/mercury-2
$0.25$0.75$0.025
OpenRouter logo
OpenRouter
inception/mercury-2
$0.25$0.75$0.025
Vercel AI Gateway logo
Vercel AI Gateway
inception/mercury-2
$0.25$0.75$0.025

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Mercury 2128K$0.250$0.750Current
Mercury 2 Edit32K$0.250$0.750Available
Mercury Coder Small Beta32K$0.250$1.00Available
MercuryAvailable
Mercury CoderAvailable

Model IDs

inception-mercury-2
inception/mercury-2
mercury-2