GLM-4.6V is Zhipu AI's language model with a 131K context window and up to 33K output tokens, available from 4 providers, starting at $0.3 / 1M input and $0.9 / 1M output. A multimodal MoE vision-language model from Z AI in the GLM-V family, supporting vision, file input, and scalable reinforcement learning-based reasoning.
Specifications
Canonical IDzhipu-glm-4-6v
TypeLanguage
StatusActive
CreatorZhipu AIZhipu AI
Providers
Context Window131K tokens
Max Output33K tokens
Input ModalitiesImagePDFTextVideo
Output ModalitiesText
Reasoning Effortsdefault
Parameters108B
HuggingFace Likes390
HuggingFace Downloads (30d)6,998
HuggingFace Downloads (all-time)405,792
Release Date · 6 months ago
Benchmarks
Intelligence Index
11.0
#264
Math Index
26.3
#186
MMLU-Pro
0.8
#162
GPQA
0.6
#287
HLE
0.0
#430
LiveCodeBench
0.4
#162
IFBench
0.3
#351
Time to First Token
1.31s
#380
SciCode
0.3
#276
AIME 2025
0.3
#186
LCR
0.1
#290
TerminalBench Hard
0.0
#294
TAU2
0.3
#233
Output TPS
58.8
#196

Capabilities

Input4/5
Text
Image
Audio·
Video
PDF
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities6/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
Hugging Face logo
Hugging Face
novita:zai-org/glm-4.6v
$0.3$0.9N/A
Novita logo
Novita
novita/zai-org/glm-4.6v
$0.3$0.9$0.055
OpenRouter logo
OpenRouter
z-ai/glm-4.6v
$0.3$0.9$0.055
Vercel AI Gateway logo
Vercel AI Gateway
zai/glm-4.6v
$0.3$0.9$0.05

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
GLM-4.6V131K$0.300$0.900Current
GLM-4.5V131K$0.600$1.20Available

Model IDs

glm-4-6v
glm-4-6v-reasoning
novita/zai-org/glm-4.6v
z-ai/glm-4.6v
zai-org/glm-4.6v
zai-org/GLM-4.6V
zai/glm-4.6v
zhipu-glm-4-6v