← Back to all models

GLM-4.6V-Flash

glm-4.6v-flash

For local deployment and low-latency applications. GLM-4.6V series are Z.ai’s iterations in a multimodal large language model. GLM-4.6V scales its context window to 128k tokens in training, and achieves SoTA performance in visual understanding among models of similar parameter scales.

Available at 4 Providers

Provider Source Input Price ($/1M) Output Price ($/1M) Description Free
vercel vercel Input: $0.00 Output: $0.00 For local deployment and low-latency applications. GLM-4.6V series are Z.ai’s iterations in a multimodal large language model. GLM-4.6V scales its context window to 128k tokens in training, and achieves SoTA performance in visual understanding among models of similar parameter scales.
zenmux models-dev Input: $0.00 Output: $0.00 Provider: ZenMux, Context: 200000, Output Limit: 64000
zhipuai models-dev Input: $0.00 Output: $0.00 Provider: Zhipu AI, Context: 128000, Output Limit: 32768
zai zai Input: $0.00 Output: $0.00 -