Zhipu AI logo

GLM-4.5 Flash


GLM-4.5 Flash is Zhipu AI logoZhipu AI's language model with a 128K context window and up to 32K output tokens. A fast, lightweight tier of the GLM-4.5 MoE family designed for low-latency agent and tool-use applications.
Spec
Canonical IDzhipu-glm-4-5-flash
TypeLanguage
StatusActive
CreatorZhipu AIZhipu AI
Context Window128K tokens
Max Output32K tokens
Input ModalitiesText
Output ModalitiesText

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities1/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
GLM-4.6V-Flash128KAvailable
GLM-4.6V-Flash128KAvailable
GLM-4.5 Flash128KCurrent

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
GLM 5V Turbo203K$1.20$4.00
GLM-5 Turbo203K$1.20$4.00
GLM-5 Code200K$1.20$5.00
GLM-5 MaaS200K$1.00$3.20
GLM 4.7 FlashX200K$0.060$0.400
GLM-4.7 FP8203K$0.400$2.00
GLM-4.7 MaaS200K
GLM-4.5 X128K$2.20$8.90
GLM-4.5 Air FP8Air128K$0.200$1.10
GLM-4.5 AirXAirx128K$1.10$4.50

Model IDs