GLM-4.7 Flash is Zhipu AI's language model with a 203K context window and up to 131K output tokens, available from 3 providers, starting at $0.06 / 1M input and $0.4 / 1M output. A lightweight 30B-A3B MoE model from Z AI that balances strong performance with efficiency, optimized for fast inference and agentic tasks.
Specifications
Canonical IDzhipu-glm-4-7-flash
TypeLanguage
StatusActive
CreatorZhipu AIZhipu AI
Providers
Context Window203K tokens
Max Output131K tokens
Input ModalitiesImageText
Output ModalitiesText
Reasoning Effortsdefault
Parameters31.2B
HuggingFace Likes1,708
HuggingFace Downloads (30d)682,370
HuggingFace Downloads (all-time)4,269,790
Release Date · 5 months ago
Benchmarks
Intelligence Index
22.9
#141
GPQA
0.6
#278
HLE
0.1
#203
IFBench
0.6
#104
Time to First Token
0.91s
#327
SciCode
0.3
#201
LCR
0.3
#197
TerminalBench Hard
0.2
#134
TAU2
1.0
#4
Output TPS
99.7
#123

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities5/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandardBatchFlexPriority
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Amazon Bedrock logo
Amazon Bedrock
zai.glm-4.7-flash
$0.07$0.4N/A$0.035$0.2$0.035$0.2$0.122$0.7
OpenRouter logo
OpenRouter
z-ai/glm-4.7-flash
$0.06$0.4$0.01
Vercel AI Gateway logo
Vercel AI Gateway
zai/glm-4.7-flash
$0.07$0.4N/A

Cost Calculator

US Dollar ($)
Preset:

Model IDs

accounts/fireworks/models/glm-4p7-flash
cloudflare/@cf/zai-org/glm-4.7-flash
glm-4-7-flash
glm-4-7-flash-non-reasoning
openrouter/z-ai/glm-4.7-flash
z-ai/glm-4.7-flash
zai-org/glm-4.7-flash
zai-org/GLM-4.7-Flash
zai.glm-4.7-flash
zai/glm-4.7-flash
zhipu-glm-4-7-flash