GLM-4.6V-Flash is
Zhipu AI's language model with a 128K context window and up to 24K output tokens. A low-latency multimodal variant of GLM-4.6 optimized for local deployment with vision understanding and 128K context support.
Capabilities
Input2/5
·
✓
·
·
✓
Output1/5
✓
·
·
·
·
Capabilities3/13
✓
·
✓
·
·
·
·
·
·
·
·
✓
·
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| GLM-4.6V-Flash | 128K | — | — | Current | |
| GLM-4.6V-Flash | 128K | — | — | Available | |
| GLM-4.5 Flash | — | 128K | — | — | Available |
Other models
| Model | Tier | Released | Context | Input / 1M | Output / 1M |
|---|---|---|---|---|---|
| GLM 5V Turbo | — | 203K | $1.20 | $4.00 | |
| GLM-5 Turbo | — | 203K | $1.20 | $4.00 | |
| GLM-5 Code | — | — | 200K | $1.20 | $5.00 |
| GLM-5 MaaS | — | — | 200K | $1.00 | $3.20 |
| GLM 4.7 FlashX | — | 200K | $0.060 | $0.400 | |
| GLM-4.7 FP8 | — | — | 203K | $0.400 | $2.00 |
| GLM-4.7 MaaS | — | — | 200K | — | — |
| GLM-4.5 X | — | — | 128K | $2.20 | $8.90 |
| GLM-4.5 Air FP8 | Air | — | 128K | $0.200 | $1.10 |
| GLM-4.5 AirX | Airx | — | 128K | $1.10 | $4.50 |