Kimi K2 0905

kimi-k2-0905

Kimi K2 0905 has shown strong performance on agentic tasks thanks to its tool calling, reasoning abilities, and long context handling. But as a large parameter model (1T parameters), it’s also resource-intensive. Running it in production requires a highly optimized inference stack to avoid excessive latency.

Provider	Source	Input Price ($/1M)	Output Price ($/1M)	Description
vercel	vercel	Input: $0.60	Output: $2.50	Kimi K2 0905 has shown strong performance on agentic tasks thanks to its tool calling, reasoning abilities, and long context handling. But as a large parameter model (1T parameters), it’s also resource-intensive. Running it in production requires a highly optimized inference stack to avoid excessive latency.
together	together	Input: $1.00	Output: $3.00	-
helicone	models-dev	Input: $0.50	Output: $2.00	Provider: Helicone, Context: 262144, Output Limit: 16384
zenmux	models-dev	Input: $0.60	Output: $2.50	Provider: ZenMux, Context: 262100, Output Limit: 64000
iflowcn	models-dev	Input: $0.00	Output: $0.00	Provider: iFlow, Context: 256000, Output Limit: 64000
aihubmix	models-dev	Input: $0.55	Output: $2.19	Provider: AIHubMix, Context: 262144, Output Limit: 262144
openrouter	openrouter	Input: $0.39	Output: $1.90	Kimi K2 0905 is the September update of [Kimi K2 0711](moonshotai/kimi-k2). It is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It supports long-context inference up to 256k tokens, extended from the previous 128k. This update improves agentic coding with higher accuracy and better generalization across scaffolds, and enhances frontend coding with more aesthetic and functional outputs for web, 3D, and related tasks. Kimi K2 is optimized for agentic capabilities, including advanced tool use, reasoning, and code synthesis. It excels across coding (LiveCodeBench, SWE-bench), reasoning (ZebraLogic, GPQA), and tool-use (Tau2, AceBench) benchmarks. The model is trained with a novel stack incorporating the MuonClip optimizer for stable large-scale MoE training. Context: 262144

Kimi K2 0905

Available at 7 Providers