mercury
Provider: Inception, Context: 128000, Output Limit: 16384
| Provider | Source | Input Price ($/1M) | Output Price ($/1M) | Description | Free |
|---|---|---|---|---|---|
| inception | models-dev | Input: $0.25 | Output: $1.00 | Provider: Inception, Context: 128000, Output Limit: 16384 | |
| openrouter | openrouter | Input: $0.25 | Output: $1.00 | Mercury is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like GPT-4.1 Nano and Claude 3.5 Haiku while matching their performance. Mercury's speed enables developers to provide responsive user experiences, including with voice agents, search interfaces, and chatbots. Read more in the [blog post] (https://www.inceptionlabs.ai/blog/introducing-mercury) here. Context: 128000 |