Inception: Mercury Coder

InceptionID: inception/mercury-coder

Mercury Coder is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like Claude 3.5 Haiku and GPT-4o Mini while matching their performance. Mercury Coder's speed means that developers can stay in the flow while coding, enjoying rapid chat-based iteration and responsive code completion suggestions. On Copilot Arena, Mercury Coder ranks 1st in speed and ties for 2nd in quality. Read more in the [blog post here](https://www.inceptionlabs.ai/blog/introducing-mercury).

Pricing per 1M Tokens

Input (Prompt)	$0.25
Output (Completion)	$0.75
Cache Read	$0.02
Cache Write	Free
Image	N/A

Specifications

Context Length	128K
Max Output Tokens	32K
Input Modalities	Text
Output Modalities	Text
Tokenizer	Other
Instruct Type	N/A
Top Provider Context	128K
Top Provider Max Output	32K
Moderated	No

Compare this model

See how Inception: Mercury Coder stacks up against other models.

vs Inception: Mercury 2 vs Inception: Mercury

More from Inception

Last updated: March 23, 2026

First tracked: March 23, 2026