← Back to all models

Kimi K2 0905

kimi-k2-0905

Kimi K2 0905 has shown strong performance on agentic tasks thanks to its tool calling, reasoning abilities, and long context handling. But as a large parameter model (1T parameters), it’s also resource-intensive. Running it in production requires a highly optimized inference stack to avoid excessive latency.

Available at 7 Providers

Provider Source Input Price ($/1M) Output Price ($/1M) Description Free
vercel vercel Input: $0.60 Output: $2.50 Kimi K2 0905 has shown strong performance on agentic tasks thanks to its tool calling, reasoning abilities, and long context handling. But as a large parameter model (1T parameters), it’s also resource-intensive. Running it in production requires a highly optimized inference stack to avoid excessive latency.
together together Input: $1.00 Output: $3.00 -
helicone models-dev Input: $0.50 Output: $2.00 Provider: Helicone, Context: 262144, Output Limit: 16384
zenmux models-dev Input: $0.60 Output: $2.50 Provider: ZenMux, Context: 262100, Output Limit: 64000
iflowcn models-dev Input: $0.00 Output: $0.00 Provider: iFlow, Context: 256000, Output Limit: 64000
aihubmix models-dev Input: $0.55 Output: $2.19 Provider: AIHubMix, Context: 262144, Output Limit: 262144
openrouter openrouter Input: $0.39 Output: $1.90 Kimi K2 0905 is the September update of [Kimi K2 0711](moonshotai/kimi-k2). It is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It supports long-context inference up to 256k tokens, extended from the previous 128k. This update improves agentic coding with higher accuracy and better generalization across scaffolds, and enhances frontend coding with more aesthetic and functional outputs for web, 3D, and related tasks. Kimi K2 is optimized for agentic capabilities, including advanced tool use, reasoning, and code synthesis. It excels across coding (LiveCodeBench, SWE-bench), reasoning (ZebraLogic, GPQA), and tool-use (Tau2, AceBench) benchmarks. The model is trained with a novel stack incorporating the MuonClip optimizer for stable large-scale MoE training. Context: 262144