← Back to all models

Qwen3 Max

qwen3-max

The Qwen 3 series Max model has undergone specialized upgrades in agent programming and tool invocation compared to the preview version. The officially released model this time has achieved state-of-the-art (SOTA) performance in its field and is better suited to meet the demands of agents operating in more complex scenarios.

Available at 6 Providers

Provider Source Input Price ($/1M) Output Price ($/1M) Description Free
vercel vercel Input: $1.20 Output: $6.00 The Qwen 3 series Max model has undergone specialized upgrades in agent programming and tool invocation compared to the preview version. The officially released model this time has achieved state-of-the-art (SOTA) performance in its field and is better suited to meet the demands of agents operating in more complex scenarios.
poe poe Input: - Output: - Qwen3-Max is a major update to the Qwen3 series, delivering significant improvements in reasoning, instruction following, and multilingual support. It provides higher accuracy in complex tasks like coding and math, along with reduced hallucinations and better performance on open-ended questions. This model is served by Alibaba Cloud Int. from Singapore.
alibaba models-dev Input: $1.20 Output: $6.00 Provider: Alibaba, Context: 262144, Output Limit: 65536
alibabacn models-dev Input: $0.86 Output: $3.44 Provider: Alibaba (China), Context: 262144, Output Limit: 65536
iflowcn models-dev Input: $0.00 Output: $0.00 Provider: iFlow, Context: 256000, Output Limit: 32000
openrouter openrouter Input: $1.20 Output: $6.00 Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It delivers higher accuracy in math, coding, logic, and science tasks, follows complex instructions in Chinese and English more reliably, reduces hallucinations, and produces higher-quality responses for open-ended Q&A, writing, and conversation. The model supports over 100 languages with stronger translation and commonsense reasoning, and is optimized for retrieval-augmented generation (RAG) and tool calling, though it does not include a dedicated “thinking” mode. Context: 256000