← Back to all models

zai-glm-4.6-cs

zai-glm-4.6-cs

World’s fastest inference for ZAI GLM 4.6 with Cerebras. ZAI GLM 4.6 is a high‑performance AI model designed for advanced reasoning, superior coding, and effective tool use. It supports structured outputs, parallel tool calling, and real‑time streaming responses. Optimized for agentic coding and automation tasks, the model delivers strong real‑world performance with a context window of up to 131K tokens and output up to 40K tokens. For more information see: https://inference-docs.cerebras.ai/models/zai-glm-46 Context Limit: 131k

Available at 1 Provider

Provider Source Input Price ($/1M) Output Price ($/1M) Description Free
poe poe Input: $19,000.00 Output: - World’s fastest inference for ZAI GLM 4.6 with Cerebras. ZAI GLM 4.6 is a high‑performance AI model designed for advanced reasoning, superior coding, and effective tool use. It supports structured outputs, parallel tool calling, and real‑time streaming responses. Optimized for agentic coding and automation tasks, the model delivers strong real‑world performance with a context window of up to 131K tokens and output up to 40K tokens. For more information see: https://inference-docs.cerebras.ai/models/zai-glm-46 Context Limit: 131k