GLM-4.5 Air FP8 is
Zhipu AI's language model with a 128K context window, starting at $0.200 / 1M input and $1.10 / 1M output. An FP8-quantized version of GLM-4.5 Air, delivering efficient inference for agent-centric tasks with reduced memory footprint.
Capabilities
Input1/5
✓
·
·
·
·
Output1/5
✓
·
·
·
·
Capabilities3/13
·
·
✓
✓
✓
·
·
·
·
·
·
·
·
Pricing by Provider
| Provider | Standard | |
|---|---|---|
| Input $ / 1M | Output $ / 1M | |
Together AI | $0.200 | $1.10 |
Cost Calculator
Preset:
Compares every provider & tier in USD
Other models
| Model | Tier | Released | Context | Input / 1M | Output / 1M |
|---|---|---|---|---|---|
| GLM 5V Turbo | — | 203K | $1.20 | $4.00 | |
| GLM-5 Turbo | — | 203K | $1.20 | $4.00 | |
| GLM-5 Code | — | — | 200K | $1.20 | $5.00 |
| GLM-5 MaaS | — | — | 200K | $1.00 | $3.20 |
| GLM 4.7 FlashX | — | 200K | $0.060 | $0.400 | |
| GLM-4.7 FP8 | — | — | 203K | $0.400 | $2.00 |
| GLM-4.7 MaaS | — | — | 200K | — | — |
| GLM-4.6V-Flash | Flash | 128K | — | — | |
| GLM-4.6V-Flash | Flash | 128K | — | — | |
| GLM-4.5 AirX | Airx | — | 128K | $1.10 | $4.50 |