GLM-4.5 Flash is
Zhipu AI's language model with a 128K context window and up to 32K output tokens. A fast, lightweight tier of the GLM-4.5 MoE family designed for low-latency agent and tool-use applications.
Capabilities
Input1/5
✓
·
·
·
·
Output1/5
✓
·
·
·
·
Capabilities1/13
·
·
✓
·
·
·
·
·
·
·
·
·
·
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| GLM-4.6V-Flash | 128K | — | — | Available | |
| GLM-4.5 Flash | — | 128K | — | — | Current |
Other models
| Model | Tier | Released | Context | Input / 1M | Output / 1M |
|---|---|---|---|---|---|
| GLM 5V Turbo | — | 203K | $1.20 | $4.00 | |
| GLM-5 Turbo | — | 203K | $1.20 | $4.00 | |
| GLM-5 Code | — | — | 200K | $1.20 | $5.00 |
| GLM-5 MaaS | — | — | 200K | $1.00 | $3.20 |
| GLM 4.7 FlashX | — | 200K | $0.060 | $0.400 | |
| GLM-4.7 FP8 | — | — | 203K | $0.400 | $2.00 |
| GLM-4.7 MaaS | — | — | 200K | — | — |
| GLM-4.5 X | — | — | 128K | $2.20 | $8.90 |
| GLM-4.5 Air FP8 | Air | — | 128K | $0.200 | $1.10 |
| GLM-4.5 AirX | Airx | — | 128K | $1.10 | $4.50 |