Zhipu AI logo

GLM-4.5 Air FP8


GLM-4.5 Air FP8 is Zhipu AI's language model with a 128K context window, starting at $0.200 / 1M input and $1.10 / 1M output. An FP8-quantized version of the GLM-4.5 Air MoE model, optimized for memory-efficient deployment while preserving agentic reasoning capabilities.
Specifications
Canonical IDzhipu-glm-4-5-air-fp8
TypeLanguage
StatusActive
CreatorZhipu AIZhipu AI
Providers
Context Window128K tokens
Input ModalitiesText
Output ModalitiesText

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities3/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Together AI logo
Together AI
together_ai/zai-org/GLM-4.5-Air-FP8
$0.200$1.10

Cost Calculator

Preset:
Compares every provider & tier in USD

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
GLM-5V Turbo203K$1.20$4.00
GLM-5 Turbo203K$1.20$4.00
GLM-5.1 Non-Reasoning
GLM-5 Non-Reasoning
GLM-5 Code200K$1.20$5.00
GLM-4.6V FlashFlash128K
GLM-4 32B128K$0.100$0.100
GLM-4.7 FlashX200K$0.060$0.400
GLM-4.7 Non-Reasoning
GLM-4.6 Reasoning

Model IDs