GLM-4.6V-Flash

glm-4.6v-flash

For local deployment and low-latency applications. GLM-4.6V series are Z.ai’s iterations in a multimodal large language model. GLM-4.6V scales its context window to 128k tokens in training, and achieves SoTA performance in visual understanding among models of similar parameter scales.

Provider	Source	Input Price ($/1M)	Output Price ($/1M)	Description
vercel	vercel	Input: $0.00	Output: $0.00	For local deployment and low-latency applications. GLM-4.6V series are Z.ai’s iterations in a multimodal large language model. GLM-4.6V scales its context window to 128k tokens in training, and achieves SoTA performance in visual understanding among models of similar parameter scales.
zenmux	models-dev	Input: $0.00	Output: $0.00	Provider: ZenMux, Context: 200000, Output Limit: 64000
zhipuai	models-dev	Input: $0.00	Output: $0.00	Provider: Zhipu AI, Context: 128000, Output Limit: 32768
zai	zai	Input: $0.00	Output: $0.00	-

GLM-4.6V-Flash

Available at 4 Providers