o4-mini

Provider	Source	Input Price ($/1M)	Output Price ($/1M)	Description
vercel	vercel	Input: $1.10	Output: $4.40	OpenAI's o4-mini delivers fast, cost-efficient reasoning with exceptional performance for its size, particularly excelling in math (best-performing on AIME benchmarks), coding, and visual tasks.
poe	poe	Input: $0.99	Output: $4.00	o4-mini provides high intelligence on a variety of tasks and domains, including science, math, and coding at an affordable price point. This bot uses medium reasoning effort by low, medium & high are also selectable; supports 200k tokens of input context and 100k tokens of output context. To instruct the bot to use more reasoning effort, add --reasoning_effort to the end of your message with one of "low", "medium", or "high".
githubcopilot	models-dev	Input: $0.00	Output: $0.00	Provider: GitHub Copilot, Context: 128000, Output Limit: 65536
abacus	models-dev	Input: $1.10	Output: $4.40	Provider: Abacus, Context: 200000, Output Limit: 100000
githubmodels	models-dev	Input: $0.00	Output: $0.00	Provider: GitHub Models, Context: 200000, Output Limit: 100000
azure	models-dev	Input: $1.10	Output: $4.40	Provider: Azure, Context: 200000, Output Limit: 100000
helicone	models-dev	Input: $1.10	Output: $4.40	Provider: Helicone, Context: 200000, Output Limit: 100000
cloudflareaigateway	models-dev	Input: $1.10	Output: $4.40	Provider: Cloudflare AI Gateway, Context: 200000, Output Limit: 100000
openai	models-dev	Input: $1.10	Output: $4.40	Provider: OpenAI, Context: 200000, Output Limit: 100000
requesty	models-dev	Input: $1.10	Output: $4.40	Provider: Requesty, Context: 200000, Output Limit: 100000
aihubmix	models-dev	Input: $1.50	Output: $6.00	Provider: AIHubMix, Context: 200000, Output Limit: 65536
azurecognitiveservices	models-dev	Input: $1.10	Output: $4.40	Provider: Azure Cognitive Services, Context: 200000, Output Limit: 100000
openrouter	openrouter	Input: $1.10	Output: $4.40	OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. It supports tool use and demonstrates competitive reasoning and coding performance across benchmarks like AIME (99.5% with Python) and SWE-bench, outperforming its predecessor o3-mini and even approaching o3 in some domains. Despite its smaller size, o4-mini exhibits high accuracy in STEM tasks, visual problem solving (e.g., MathVista, MMMU), and code editing. It is especially well-suited for high-throughput scenarios where latency or cost is critical. Thanks to its efficient architecture and refined reinforcement learning training, o4-mini can chain tools, generate structured outputs, and solve multi-step tasks with minimal delay—often in under a minute. Context: 200000

Available at 13 Providers