Qwen3 Max

qwen3-max

The Qwen 3 series Max model has undergone specialized upgrades in agent programming and tool invocation compared to the preview version. The officially released model this time has achieved state-of-the-art (SOTA) performance in its field and is better suited to meet the demands of agents operating in more complex scenarios.

Provider	Source	Input Price ($/1M)	Output Price ($/1M)	Description
vercel	vercel	Input: $1.20	Output: $6.00	The Qwen 3 series Max model has undergone specialized upgrades in agent programming and tool invocation compared to the preview version. The officially released model this time has achieved state-of-the-art (SOTA) performance in its field and is better suited to meet the demands of agents operating in more complex scenarios.
poe	poe	Input: -	Output: -	Qwen3-Max is a major update to the Qwen3 series, delivering significant improvements in reasoning, instruction following, and multilingual support. It provides higher accuracy in complex tasks like coding and math, along with reduced hallucinations and better performance on open-ended questions. This model is served by Alibaba Cloud Int. from Singapore.
alibaba	models-dev	Input: $1.20	Output: $6.00	Provider: Alibaba, Context: 262144, Output Limit: 65536
alibabacn	models-dev	Input: $0.86	Output: $3.44	Provider: Alibaba (China), Context: 262144, Output Limit: 65536
iflowcn	models-dev	Input: $0.00	Output: $0.00	Provider: iFlow, Context: 256000, Output Limit: 32000
openrouter	openrouter	Input: $1.20	Output: $6.00	Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It delivers higher accuracy in math, coding, logic, and science tasks, follows complex instructions in Chinese and English more reliably, reduces hallucinations, and produces higher-quality responses for open-ended Q&A, writing, and conversation. The model supports over 100 languages with stronger translation and commonsense reasoning, and is optimized for retrieval-augmented generation (RAG) and tool calling, though it does not include a dedicated “thinking” mode. Context: 256000

Qwen3 Max

Available at 6 Providers