o4-mini
OpenAI's o4-mini delivers fast, cost-efficient reasoning with exceptional performance for its size, particularly excelling in math (best-performing on AIME benchmarks), coding, and visual tasks.
| Provider | Source | Input Price ($/1M) | Output Price ($/1M) | Description | Free |
|---|---|---|---|---|---|
| vercel | vercel | Input: $1.10 | Output: $4.40 | OpenAI's o4-mini delivers fast, cost-efficient reasoning with exceptional performance for its size, particularly excelling in math (best-performing on AIME benchmarks), coding, and visual tasks. | |
| poe | poe | Input: $0.99 | Output: $4.00 | o4-mini provides high intelligence on a variety of tasks and domains, including science, math, and coding at an affordable price point. This bot uses medium reasoning effort by low, medium & high are also selectable; supports 200k tokens of input context and 100k tokens of output context. To instruct the bot to use more reasoning effort, add --reasoning_effort to the end of your message with one of "low", "medium", or "high". | |
| githubcopilot | models-dev | Input: $0.00 | Output: $0.00 | Provider: GitHub Copilot, Context: 128000, Output Limit: 65536 | |
| abacus | models-dev | Input: $1.10 | Output: $4.40 | Provider: Abacus, Context: 200000, Output Limit: 100000 | |
| githubmodels | models-dev | Input: $0.00 | Output: $0.00 | Provider: GitHub Models, Context: 200000, Output Limit: 100000 | |
| azure | models-dev | Input: $1.10 | Output: $4.40 | Provider: Azure, Context: 200000, Output Limit: 100000 | |
| helicone | models-dev | Input: $1.10 | Output: $4.40 | Provider: Helicone, Context: 200000, Output Limit: 100000 | |
| cloudflareaigateway | models-dev | Input: $1.10 | Output: $4.40 | Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 100000 | |
| openai | models-dev | Input: $1.10 | Output: $4.40 | Provider: OpenAI, Context: 200000, Output Limit: 100000 | |
| requesty | models-dev | Input: $1.10 | Output: $4.40 | Provider: Requesty, Context: 200000, Output Limit: 100000 | |
| aihubmix | models-dev | Input: $1.50 | Output: $6.00 | Provider: AIHubMix, Context: 200000, Output Limit: 65536 | |
| azurecognitiveservices | models-dev | Input: $1.10 | Output: $4.40 | Provider: Azure Cognitive Services, Context: 200000, Output Limit: 100000 | |
| openrouter | openrouter | Input: $1.10 | Output: $4.40 | OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. It supports tool use and demonstrates competitive reasoning and coding performance across benchmarks like AIME (99.5% with Python) and SWE-bench, outperforming its predecessor o3-mini and even approaching o3 in some domains. Despite its smaller size, o4-mini exhibits high accuracy in STEM tasks, visual problem solving (e.g., MathVista, MMMU), and code editing. It is especially well-suited for high-throughput scenarios where latency or cost is critical. Thanks to its efficient architecture and refined reinforcement learning training, o4-mini can chain tools, generate structured outputs, and solve multi-step tasks with minimal delay—often in under a minute. Context: 200000 |