glm-4.6v-flash
For local deployment and low-latency applications. GLM-4.6V series are Z.ai’s iterations in a multimodal large language model. GLM-4.6V scales its context window to 128k tokens in training, and achieves SoTA performance in visual understanding among models of similar parameter scales.
| Provider | Source | Input Price ($/1M) | Output Price ($/1M) | Description | Free |
|---|---|---|---|---|---|
| vercel | vercel | Input: $0.00 | Output: $0.00 | For local deployment and low-latency applications. GLM-4.6V series are Z.ai’s iterations in a multimodal large language model. GLM-4.6V scales its context window to 128k tokens in training, and achieves SoTA performance in visual understanding among models of similar parameter scales. | |
| zenmux | models-dev | Input: $0.00 | Output: $0.00 | Provider: ZenMux, Context: 200000, Output Limit: 64000 | |
| zhipuai | models-dev | Input: $0.00 | Output: $0.00 | Provider: Zhipu AI, Context: 128000, Output Limit: 32768 | |
| zai | zai | Input: $0.00 | Output: $0.00 | - |