← Back to all models

GPT-4.1

gpt-4.1

GPT 4.1 is OpenAI's flagship model for complex tasks. It is well suited for problem solving across domains.

Available at 14 Providers

Provider Source Input Price ($/1M) Output Price ($/1M) Description Free
vercel vercel Input: $2.00 Output: $8.00 GPT 4.1 is OpenAI's flagship model for complex tasks. It is well suited for problem solving across domains.
poe poe Input: $1.80 Output: $7.20 OpenAI’s GPT-4.1 significantly improves on past models in terms of its coding skills, long context (1M tokens), and improved instruction following. Supports native vision, and generally has more intelligence than GPT-4o. Provides a 75% chat history cache discount. Check out the newest version of this bot here: https://poe.com/GPT-5.
githubcopilot models-dev Input: $0.00 Output: $0.00 Provider: GitHub Copilot, Context: 128000, Output Limit: 16384
abacus models-dev Input: $2.00 Output: $8.00 Provider: Abacus, Context: 1047576, Output Limit: 32768
cortecs models-dev Input: $2.35 Output: $9.42 Provider: Cortecs, Context: 1047576, Output Limit: 32768
githubmodels models-dev Input: $0.00 Output: $0.00 Provider: GitHub Models, Context: 128000, Output Limit: 16384
azure models-dev Input: $2.00 Output: $8.00 Provider: Azure, Context: 1047576, Output Limit: 32768
helicone models-dev Input: $2.00 Output: $8.00 Provider: Helicone, Context: 1047576, Output Limit: 32768
fastrouter models-dev Input: $2.00 Output: $8.00 Provider: FastRouter, Context: 1047576, Output Limit: 32768
openai models-dev Input: $2.00 Output: $8.00 Provider: OpenAI, Context: 1047576, Output Limit: 32768
requesty models-dev Input: $2.00 Output: $8.00 Provider: Requesty, Context: 1047576, Output Limit: 32768
aihubmix models-dev Input: $2.00 Output: $8.00 Provider: AIHubMix, Context: 1047576, Output Limit: 32768
azurecognitiveservices models-dev Input: $2.00 Output: $8.00 Provider: Azure Cognitive Services, Context: 1047576, Output Limit: 32768
openrouter openrouter Input: $2.00 Output: $8.00 GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and GPT-4.5 across coding (54.6% SWE-bench Verified), instruction compliance (87.4% IFEval), and multimodal understanding benchmarks. It is tuned for precise code diffs, agent reliability, and high recall in large document contexts, making it ideal for agents, IDE tooling, and enterprise knowledge retrieval. Context: 1047576