Glm 4.7 Flash

Glm 4.7 Flash is a text model from OpenRouter logoOpenRouter with a context window of 200K tokens and max output of 32K tokens. Pricing starts at 0.07 per million input tokens and 0.40 per million output tokens (cheapest at AWS Bedrock).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keyopenrouter/z-ai/glm-4.7-flash
ProviderOpenRouter logoOpenRouter
Provider IDopenrouter
ModeText
Canonical Nameglm-4.7-flash
Context Window200K tokens
Max Output32K tokens

Pricing

00
TypePer 1K TokensPer 1M Tokens
Input Tokens0.0000700.070
Output Tokens0.0004000.400

Benchmarks

Intelligence Index30.1#53
Coding Index25.9#50
GPQA0.6#109
HLE0.1#68
IFBench0.6#36
Time to First Token0.91s#168
SciCode0.3#77
LCR0.3#72
TerminalBench Hard0.2#46
TAU21.0#1

Price Comparison by Provider

Compare prices for Glm 4.7 Flash across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
AWS Bedrock logoAWS Bedrockzai.glm-4.7-flash0.0700.400
OpenRouter logoOpenRouteropenrouter/z-ai/glm-4.7-flash0.0700.400

All Variants

All available versions, regions, and API endpoints for Glm 4.7 Flash.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
zai.glm-4.7-flashAWS Bedrock logoAWS BedrockText0.0700.400200K128Knoyes
openrouter/z-ai/glm-4.7-flashOpenRouter logoOpenRouterText0.0700.400200K32Kyesyes