GLM-4.7 Flash is Zhipu AI's language model with a 203K context window and up to 131K output tokens, available from 3 providers, starting at $0.060 / 1M input and $0.400 / 1M output. A lightweight 30B-A3B MoE model from Z AI that balances strong performance with efficiency, optimized for fast inference and agentic tasks.
Specifications
Canonical IDzhipu-glm-4-7-flash
TypeLanguage
StatusActive
CreatorZhipu AIZhipu AI
Providers
Context Window203K tokens
Max Output131K tokens
Input ModalitiesImageText
Output ModalitiesText
Reasoning Effortsdefault
Parameters31.2B
HuggingFace Likes1,708
HuggingFace Downloads (30d)682,370
HuggingFace Downloads (all-time)4,269,790
Release Date · 4 months ago
Benchmarks
Intelligence Index
30.1
#131
Coding Index
25.9
#127
GPQA
0.6
#261
HLE
0.1
#186
IFBench
0.6
#91
Time to First Token
0.88s
#309
SciCode
0.3
#184
LCR
0.3
#181
TerminalBench Hard
0.2
#122
TAU2
1.0
#3
Output TPS
80.0
#143

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities4/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatchFlexPriority
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
OpenRouter logo
OpenRouter
z-ai/glm-4.7-flash
$0.060$0.400$0.010
Amazon Bedrock logo
Amazon Bedrock
zai.glm-4.7-flash
$0.070$0.400N/A$0.035$0.200$0.035$0.200$0.122$0.700
Vercel AI Gateway logo
Vercel AI Gateway
zai/glm-4.7-flash
$0.070$0.400N/A

Cost Calculator

Preset:

Model IDs