| Model | Input | Cached input | Output | Unit |
|---|---|---|---|---|
|
Deepseek V4 Pro
DeepSeek
|
$1.74 | $0.2 | $3.48 | per 1M tokens |
|
Gemma 4 31B IT NVFP4
NVIDIA
|
$0.28 | - | $0.86 | per 1M tokens |
|
GLM 5
This model
Z.ai
|
$1 | - | $3.2 | per 1M tokens |
|
GLM 5.1
Z.ai
|
$1.4 | $0.26 | $4.4 | per 1M tokens |
|
Kimi K2.6
Moonshot AI
|
$1.2 | $0.2 | $4.5 | per 1M tokens |
|
Kimi K2.7 Code
Moonshot AI
|
$0.95 | $0.19 | $4 | per 1M tokens |
|
LFM2 24B A2B
Liquid AI
|
$0.03 | - | $0.12 | per 1M tokens |
|
Llama 3.3
Meta Platforms
|
$1.04 | - | $1.04 | per 1M tokens |
|
MiniMax M2.5
MiniMax
|
$0.3 | $0.06 | $1.2 | per 1M tokens |
|
MiniMax M2.7
MiniMax
|
$0.3 | $0.06 | $1.2 | per 1M tokens |
|
MiniMax M3
MiniMax
|
$0.3 | $0.06 | $1.2 | per 1M tokens |
| $0.6 | $0.2 | $3.6 | per 1M tokens | |
|
Qwen 3.5 9B
Alibaba
|
$0.17 | - | $0.25 | per 1M tokens |
|
Qwen 3.6 Plus
Alibaba
|
$0.5 | - | $3 | per 1M tokens |
|
Qwen 3.7 Max
Alibaba
|
$1.25 | $0.13 | $3.75 | per 1M tokens |
|
Qwen3 235B A22B
Alibaba
|
$0.2 | - | $0.6 | per 1M tokens |
|
Qwen3.5 397B A17B
Alibaba
|
$0.6 | $0.35 | $3.6 | per 1M tokens |
|
Qwen3.7-Plus
Alibaba
|
$0.32 | - | $1.28 | per 1M tokens |
GLM 5
Overview
GLM 5 is Z.ai's 744B parameter open Mixture-of-Experts flagship model for agentic engineering, delivering frontier level coding, reasoning and long horizon agent performance while remaining Apache 2.0 licensed and optimized for Chinese and global hardware.
Pricing
Compare GLM 5 with other models listed in the same vendor pricing tiers and context lengths.
Standard
Batch
| Model | Input | Cached input | Output | Unit |
|---|---|---|---|---|
|
Deepseek V4 Pro
DeepSeek
|
$0.87 | $0.2 | $1.74 | per 1M tokens |
|
Gemma 4 31B IT NVFP4
NVIDIA
|
$0.28 | - | $0.86 | per 1M tokens |
|
GLM 5
This model
Z.ai
|
$1 | - | $3.2 | per 1M tokens |
|
GLM 5.1
Z.ai
|
$0.7 | $0.26 | $2.2 | per 1M tokens |
|
Kimi K2.6
Moonshot AI
|
$1.2 | $0.2 | $4.5 | per 1M tokens |
|
Kimi K2.7 Code
Moonshot AI
|
$0.95 | $0.19 | $4 | per 1M tokens |
|
LFM2 24B A2B
Liquid AI
|
$0.01 | - | $0.06 | per 1M tokens |
|
Llama 3.3
Meta Platforms
|
$0.52 | - | $0.52 | per 1M tokens |
|
MiniMax M2.5
MiniMax
|
$0.3 | $0.06 | $1.2 | per 1M tokens |
|
MiniMax M2.7
MiniMax
|
$0.15 | $0.06 | $0.6 | per 1M tokens |
|
MiniMax M3
MiniMax
|
$0.3 | $0.06 | $1.2 | per 1M tokens |
| $0.6 | $0.2 | $3.6 | per 1M tokens | |
|
Qwen 3.5 35B A3B
Alibaba
|
$0.6 | $0.35 | $3.6 | per 1M tokens |
|
Qwen 3.5 9B
Alibaba
|
$0.17 | - | $0.25 | per 1M tokens |
|
Qwen 3.6 Plus
Alibaba
|
$0.5 | - | $3 | per 1M tokens |
|
Qwen3 235B A22B
Alibaba
|
$0.1 | - | $0.3 | per 1M tokens |
|
Qwen3.7-Plus
Alibaba
|
$1.25 | $0.13 | $3.75 | per 1M tokens |
Standard
| Model | Input | Cached input | Output | Unit |
|---|---|---|---|---|
|
GLM 5.2
Z.ai
|
CNY 8 | CNY 2 | CNY 28 | per 1M tokens |
|
GLM 5.1
Z.ai
|
CNY 6 | CNY 1.3 | CNY 24 | per 1M tokens |
|
GLM 5 Turbo
Z.ai
|
CNY 5 | CNY 1.2 | CNY 22 | per 1M tokens |
|
GLM 5
This model
Z.ai
|
CNY 4 | CNY 1 | CNY 18 | per 1M tokens |
|
GLM 5V Turbo
Z.ai
|
CNY 5 | CNY 1.2 | CNY 22 | per 1M tokens |
|
GLM 4.7
Z.ai
|
CNY 3 | CNY 0.6 | CNY 14 | per 1M tokens |
|
GLM 4.6V
Z.ai
|
CNY 1 | CNY 0.2 | CNY 3 | per 1M tokens |
|
GLM 4.5V
Z.ai
|
CNY 2 | CNY 0.4 | CNY 6 | per 1M tokens |
|
GLM 4.5 Air X
Z.ai
|
CNY 0.8 | CNY 0.16 | CNY 6 | per 1M tokens |
|
GLM 4.6V FlashX
Z.ai
|
CNY 0.15 | CNY 0.03 | CNY 1.5 | per 1M tokens |
|
GLM 4.7 FlashX
Z.ai
|
CNY 0.5 | CNY 0.1 | CNY 3 | per 1M tokens |
|
GLM 4.7 Flash
Z.ai
|
CNY 0 | - | CNY 0 | per 1M tokens |
| Model | Input | Cached input | Output | Unit |
|---|---|---|---|---|
|
GLM 5.1
Z.ai
|
CNY 8 | CNY 2 | CNY 28 | per 1M tokens |
|
GLM 5 Turbo
Z.ai
|
CNY 7 | CNY 1.8 | CNY 26 | per 1M tokens |
|
GLM 5
This model
Z.ai
|
CNY 6 | CNY 1.5 | CNY 22 | per 1M tokens |
|
GLM 5V Turbo
Z.ai
|
CNY 7 | CNY 1.8 | CNY 26 | per 1M tokens |
|
GLM 4.7
Z.ai
|
CNY 4 | CNY 0.8 | CNY 16 | per 1M tokens |
|
GLM 4.6V
Z.ai
|
CNY 2 | CNY 0.4 | CNY 6 | per 1M tokens |
|
GLM 4.5V
Z.ai
|
CNY 4 | CNY 0.8 | CNY 12 | per 1M tokens |
|
GLM 4.5 Air X
Z.ai
|
CNY 1.2 | CNY 0.24 | CNY 8 | per 1M tokens |
|
GLM 4.6V FlashX
Z.ai
|
CNY 0.3 | CNY 0.03 | CNY 3 | per 1M tokens |
About Z.ai
Z.ai (formerly Zhipu AI) is a Chinese AI company developing large language models (GLM series), combining reasoning, coding, and agent capabilities, and offering open models and APIs.
Benchmark scores
How GLM 5 ranks on tracked AI benchmarks. Click any benchmark to see its full leaderboard.
Compare across all benchmarks โTools using GLM 5
-
Cmt๐ 4 karmaOct 29, 2025@Z.aiI have been using z.ai for two weeks for web development and it's mind blowing for me. What amaze me more is its capacity to understand my existing code with very little context and suggest -or just write- lot of improvement. I did some test to compare it with ChatGPT, Claude, etc. but the results were no even close. And I keep pushing the limits but it doesn't even blink.
