GLM-4.7 Flash is Zhipu AI's language model with a 203K context window and up to 131K output tokens, available from 4 providers, starting at $0.060 / 1M input and $0.400 / 1M output. A lightweight 30B-A3B MoE model from Z AI that balances strong performance with efficiency, optimized for fast inference and agentic tasks.
Specifications
Canonical IDzhipu-glm-4-7-flash
TypeLanguage
StatusActive
CreatorZhipu AIZhipu AI
Providers
Context Window203K tokens
Max Output131K tokens
Input ModalitiesImageText
Output ModalitiesText
Reasoning Effortsdefault
Parameters31.2B
HuggingFace Likes1,708
HuggingFace Downloads (30d)682,370
HuggingFace Downloads (all-time)4,269,790
Release Date · 4 months ago
Benchmarks
Intelligence Index
30.1
#137
Coding Index
25.9
#133
GPQA
0.6
#267
HLE
0.1
#192
IFBench
0.6
#96
Time to First Token
0.85s
#311
SciCode
0.3
#190
LCR
0.3
#187
TerminalBench Hard
0.2
#128
TAU2
1.0
#3
Output TPS
93.9
#131

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities4/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatchFlexPriority
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
OpenRouter logo
OpenRouter
z-ai/glm-4.7-flash
$0.060$0.400$0.010
Amazon Bedrock logo
Amazon Bedrock
zai.glm-4.7-flash
$0.070$0.400N/A$0.035$0.200$0.035$0.200$0.122$0.700
Hugging Face logo
Hugging Face
novita:zai-org/glm-4.7-flash
$0.070$0.400N/A
Vercel AI Gateway logo
Vercel AI Gateway
zai/glm-4.7-flash
$0.070$0.400N/A
View Amazon Bedrock

Cost Calculator

Preset:

Model IDs