Glm 4.7 Flash Pricing & Specs | AI Models

Glm 4.7 Flash is a text model from OpenRouter with a context window of 200K tokens and max output of 32K tokens. Pricing starts at 0.07 per million input tokens and 0.40 per million output tokens (cheapest at AWS Bedrock).

Capabilities

✓ Vision✓ Function Calling✓ Reasoning✗ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output

Specifications

Model Key	`openrouter/z-ai/glm-4.7-flash`
Provider	OpenRouter
Provider ID	openrouter
Mode	Text
Canonical Name	glm-4.7-flash
Context Window	200K tokens
Max Output	32K tokens

Pricing

Type	Per 1K Tokens	Per 1M Tokens
Input Tokens	0.000070	0.070
Output Tokens	0.000400	0.400

Benchmarks

Intelligence Index	30.1#53
Coding Index	25.9#50
GPQA	0.6#109
HLE	0.1#68
IFBench	0.6#36
Time to First Token	0.91s#168
SciCode	0.3#77
LCR	0.3#72
TerminalBench Hard	0.2#46
TAU2	1.0#1

Price Comparison by Provider

Compare prices for Glm 4.7 Flash across different providers. The same model may be available through multiple providers at different price points.

Provider	Model Key	Input Price, $	Output Price, $
AWS Bedrock	zai.glm-4.7-flash	0.070	0.400
OpenRouter	openrouter/z-ai/glm-4.7-flash	0.070	0.400

All Variants

All available versions, regions, and API endpoints for Glm 4.7 Flash.

Model Key	Provider	Mode	Input Price, $	Output Price, $	Context	Max Output	Vision	Functions
zai.glm-4.7-flash	AWS Bedrock	Text	0.070	0.400	200K	128K	no	yes
openrouter/z-ai/glm-4.7-flash	OpenRouter	Text	0.070	0.400	200K	32K	yes	yes

← Back to All Models