Zhipu AI logo

GLM-4.5 Flash


GLM-4.5 Flash is Zhipu AI's language model with a 128K context window and up to 32K output tokens. A fast, lightweight variant of the GLM-4.5 series from Z AI, optimized for low-latency agentic and tool-use applications.
Specifications
Canonical IDzhipu-glm-4-5-flash
TypeLanguage
StatusActive
CreatorZhipu AIZhipu AI
Context Window128K tokens
Max Output32K tokens
Input ModalitiesText
Output ModalitiesText

Capabilities

Input1/5
Textβœ“
ImageΒ·
AudioΒ·
VideoΒ·
PDFΒ·
Output1/5
Textβœ“
ImageΒ·
AudioΒ·
VideoΒ·
EmbeddingΒ·
Capabilities1/13
ReasoningΒ·
Adaptive ReasoningΒ·
Function Callingβœ“
Parallel Function CallingΒ·
Structured OutputsΒ·
Native JSON SchemaΒ·
Web SearchΒ·
URL ContextΒ·
Computer UseΒ·
Code ExecutionΒ·
File SearchΒ·
Prompt CachingΒ·
Assistant PrefillΒ·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
GLM-4.6V Flash128Kβ€”β€”Available
GLM-4.5 Flashβ€”128Kβ€”β€”Current
GLM-4.7 Flash Non-Reasoningβ€”β€”β€”β€”Available

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
GLM-5V Turboβ€”203K$1.20$4.00
GLM-5 Turboβ€”203K$1.20$4.00
GLM-5.1 Non-Reasoningβ€”β€”β€”β€”β€”
GLM-5 Non-Reasoningβ€”β€”β€”β€”β€”
GLM-5 Codeβ€”β€”200K$1.20$5.00
GLM-4 32Bβ€”128K$0.100$0.100
GLM-4.7 FlashXβ€”200K$0.060$0.400
GLM-4.7 Non-Reasoningβ€”β€”β€”β€”β€”
GLM-4.6 Reasoningβ€”β€”β€”β€”β€”
GLM-4.6V Reasoningβ€”β€”β€”β€”β€”

Model IDs