GLM-4.7 Flash is Zhipu AI's language model with a 203K context window and up to 131K output tokens, available from 4 providers, starting at $0.060 / 1M input and $0.400 / 1M output. A lightweight 30B-A3B MoE model from Z AI that balances strong performance with efficiency, optimized for fast inference and agentic tasks.
Specifications
Canonical IDzhipu-glm-4-7-flash
TypeLanguage
StatusActive
CreatorZhipu AIZhipu AI
Providers
Context Window203K tokens
Max Output131K tokens
Input ModalitiesImageText
Output ModalitiesText
Reasoning Effortsdefault
Parameters31.2B
HuggingFace Likes1,708
HuggingFace Downloads (30d)682,370
HuggingFace Downloads (all-time)4,269,790
Release Date · 5 months ago
Benchmarks
Intelligence Index
30.1
#140
Coding Index
25.9
#136
GPQA
0.6
#271
HLE
0.1
#196
IFBench
0.6
#100
Time to First Token
0.99s
#332
SciCode
0.3
#194
LCR
0.3
#191
TerminalBench Hard
0.2
#131
TAU2
1.0
#3
Output TPS
77.8
#159

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities4/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatchFlexPriority
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Amazon Bedrock logo
Amazon Bedrock
zai.glm-4.7-flash
$0.070$0.400N/A$0.035$0.200$0.035$0.200$0.122$0.700
Hugging Face logo
Hugging Face
novita:zai-org/glm-4.7-flash
$0.070$0.400N/A
OpenRouter logo
OpenRouter
openrouter/z-ai/glm-4.7-flash
$0.070$0.400N/A
OpenRouter logo
OpenRouter
z-ai/glm-4.7-flash
$0.060$0.400$0.010
Vercel AI Gateway logo
Vercel AI Gateway
zai/glm-4.7-flash
$0.070$0.400N/A

Cost Calculator

Preset:

Model IDs