← Back to all models

GPT-5 nano

gpt-5-nano

GPT-5 nano is a high throughput model that excels at simple instruction or classification tasks.

Available at 13 Providers

Provider Source Input Price ($/1M) Output Price ($/1M) Description Free
vercel vercel Input: $0.05 Output: $0.40 GPT-5 nano is a high throughput model that excels at simple instruction or classification tasks.
poe poe Input: $0.04 Output: $0.36 GPT-5 nano is an extremely fast and cheap model, ideal for text/vision summarization/categorization tasks. Supports native vision and 400k input tokens of context. Provides a 90% chat history cache discount. To instruct the bot to use more reasoning effort, add --reasoning_effort to the end of your message with one of "minimal, "low", "medium", or "high" Use `--web_search true` to enable web search and real-time information access, this is disabled by default.
abacus models-dev Input: $0.05 Output: $0.40 Provider: Abacus, Context: 400000, Output Limit: 128000
azure models-dev Input: $0.05 Output: $0.40 Provider: Azure, Context: 272000, Output Limit: 128000
helicone models-dev Input: $0.05 Output: $0.40 Provider: Helicone, Context: 400000, Output Limit: 128000
opencode models-dev Input: $0.00 Output: $0.00 Provider: OpenCode Zen, Context: 400000, Output Limit: 128000
fastrouter models-dev Input: $0.05 Output: $0.40 Provider: FastRouter, Context: 400000, Output Limit: 128000
openai models-dev Input: $0.05 Output: $0.40 Provider: OpenAI, Context: 400000, Output Limit: 128000
requesty models-dev Input: $0.05 Output: $0.40 Provider: Requesty, Context: 16000, Output Limit: 4000
sapaicore models-dev Input: $0.05 Output: $0.40 Provider: SAP AI Core, Context: 400000, Output Limit: 128000
aihubmix models-dev Input: $0.50 Output: $2.00 Provider: AIHubMix, Context: 128000, Output Limit: 16384
azurecognitiveservices models-dev Input: $0.05 Output: $0.40 Provider: Azure Cognitive Services, Context: 272000, Output Limit: 128000
openrouter openrouter Input: $0.05 Output: $0.40 GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for developer tools, rapid interactions, and ultra-low latency environments. While limited in reasoning depth compared to its larger counterparts, it retains key instruction-following and safety features. It is the successor to GPT-4.1-nano and offers a lightweight option for cost-sensitive or real-time applications. Context: 400000