Name: GLM-4.5 Flash
Brand: Zhipu AI

GLM-4.5 Flash is Zhipu AI's language model with a 128K context window and up to 32K output tokens. A fast, lightweight variant of the GLM-4.5 series from Z AI, optimized for low-latency agentic and tool-use applications.

Specifications
Canonical ID	`zhipu-glm-4-5-flash`
Type	Language
Status	Active
Creator	Zhipu AI
Context Window	128K tokens
Max Output	32K tokens
Input Modalities	Text
Output Modalities	Text

Capabilities

Input1/5

Text✓

Image·

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities1/13

Reasoning·

Adaptive Reasoning·

Function Calling✓

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
GLM-4.6V Flash	2025-09-30	128K	—	—	Available
GLM-4.5 Flash	—	128K	—	—	Current
GLM-4.7 Flash Non-Reasoning	—	—	—	—	Available

Other Models

Model	Tier	Released	Context	Input / 1M	Output / 1M
GLM-5.2	—	2026-06-16	1.0M	$0.930	$3.00
GLM-5.2 Fast	—	2026-06-16	1.0M	$3.00	$10.25
GLM-5V Turbo	—	2026-04-01	203K	$1.20	$4.00
GLM-5 Turbo	—	2026-03-15	262K	$1.20	$4.00
GLM-5.1 Non-Reasoning	—	—	—	—	—
GLM-5 Non-Reasoning	—	—	—	—	—
GLM-5 Code	—	—	200K	$1.20	$5.00
GLM-5.1 Fast	—	—	203K	$2.80	$8.80
GLM-5.1 NVFP4 MTP	—	—	203K	$1.40	$4.40
GLM-4.7 FlashX	—	2026-01-19	200K	$0.060	$0.400

Model IDs

zai/glm-4.5-flash

zhipu-glm-4-5-flash

GLM-4.5 Flash

CapabilitiesAPIGET/api/v1/models/zhipu-glm-4-5-flash

VersionsAPIGET/api/v1/models?family=glm

Other ModelsAPIGET/api/v1/models/zhipu-glm-4-5-flash/similar

Model IDsAPIGET/api/v1/models/zhipu-glm-4-5-flash

Capabilities

Versions

Other Models

Model IDs