| Model | Input | Cached input | Output | Unit |
|---|---|---|---|---|
|
GLM 5.2
Z.ai
|
CNY 8 | CNY 2 | CNY 28 | per 1M tokens |
|
GLM 5.1
Z.ai
|
CNY 6 | CNY 1.3 | CNY 24 | per 1M tokens |
|
GLM 5 Turbo
Z.ai
|
CNY 5 | CNY 1.2 | CNY 22 | per 1M tokens |
|
GLM 5
Z.ai
|
CNY 4 | CNY 1 | CNY 18 | per 1M tokens |
|
GLM 5V Turbo
Z.ai
|
CNY 5 | CNY 1.2 | CNY 22 | per 1M tokens |
|
GLM 4.7
Z.ai
|
CNY 3 | CNY 0.6 | CNY 14 | per 1M tokens |
|
GLM 4.6V
Z.ai
|
CNY 1 | CNY 0.2 | CNY 3 | per 1M tokens |
|
GLM 4.5V
This model
Z.ai
|
CNY 2 | CNY 0.4 | CNY 6 | per 1M tokens |
|
GLM 4.5 Air X
Z.ai
|
CNY 0.8 | CNY 0.16 | CNY 6 | per 1M tokens |
|
GLM 4.6V FlashX
Z.ai
|
CNY 0.15 | CNY 0.03 | CNY 1.5 | per 1M tokens |
|
GLM 4.7 FlashX
Z.ai
|
CNY 0.5 | CNY 0.1 | CNY 3 | per 1M tokens |
|
GLM 4.7 Flash
Z.ai
|
CNY 0 | - | CNY 0 | per 1M tokens |
GLM 4.5V
Overview
GLM-4.5V is a 106B parameter (12B active) MoE-based visual reasoning model supporting image, video, document, and GUI task understanding. It achieves open-source SOTA performance across 42 visual multimodal benchmarks and features a switchable Thinking Mode for balancing response speed against deep reasoning depth.
Pricing
Compare GLM 4.5V with other models listed in the same vendor pricing tiers and context lengths.
Standard
| Model | Input | Cached input | Output | Unit |
|---|---|---|---|---|
|
GLM 5.1
Z.ai
|
CNY 8 | CNY 2 | CNY 28 | per 1M tokens |
|
GLM 5 Turbo
Z.ai
|
CNY 7 | CNY 1.8 | CNY 26 | per 1M tokens |
|
GLM 5
Z.ai
|
CNY 6 | CNY 1.5 | CNY 22 | per 1M tokens |
|
GLM 5V Turbo
Z.ai
|
CNY 7 | CNY 1.8 | CNY 26 | per 1M tokens |
|
GLM 4.7
Z.ai
|
CNY 4 | CNY 0.8 | CNY 16 | per 1M tokens |
|
GLM 4.6V
Z.ai
|
CNY 2 | CNY 0.4 | CNY 6 | per 1M tokens |
|
GLM 4.5V
This model
Z.ai
|
CNY 4 | CNY 0.8 | CNY 12 | per 1M tokens |
|
GLM 4.5 Air X
Z.ai
|
CNY 1.2 | CNY 0.24 | CNY 8 | per 1M tokens |
|
GLM 4.6V FlashX
Z.ai
|
CNY 0.3 | CNY 0.03 | CNY 3 | per 1M tokens |
About Z.ai
Z.ai (formerly Zhipu AI) is a Chinese AI company developing large language models (GLM series), combining reasoning, coding, and agent capabilities, and offering open models and APIs.
