gpt-5.1-chat
Provider: Azure, Context: 128000, Output Limit: 16384
| Provider | Source | Input Price ($/1M) | Output Price ($/1M) | Description | Free |
|---|---|---|---|---|---|
| azure | models-dev | Input: $1.25 | Output: $10.00 | Provider: Azure, Context: 128000, Output Limit: 16384 | |
| zenmux | models-dev | Input: $1.25 | Output: $10.00 | Provider: ZenMux, Context: 128000, Output Limit: 64000 | |
| azurecognitiveservices | models-dev | Input: $1.25 | Output: $10.00 | Provider: Azure Cognitive Services, Context: 128000, Output Limit: 16384 | |
| openrouter | openrouter | Input: $1.25 | Output: $10.00 | GPT-5.1 Chat (AKA Instant is the fast, lightweight member of the 5.1 family, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “think” on harder queries, improving accuracy on math, coding, and multi-step tasks without slowing down typical conversations. The model is warmer and more conversational by default, with better instruction following and more stable short-form reasoning. GPT-5.1 Chat is designed for high-throughput, interactive workloads where responsiveness and consistency matter more than deep deliberation. Context: 128000 |