GLM-4.6V is Zhipu AI's language model with a 131K context window and up to 33K output tokens, available from 4 providers, starting at $0.3 / 1M input and $0.9 / 1M output. A multimodal MoE vision-language model from Z AI in the GLM-V family, supporting vision, file input, and scalable reinforcement learning-based reasoning.
Specifications
Canonical IDzhipu-glm-4-6v
TypeLanguage
StatusActive
CreatorZhipu AIZhipu AI
Providers
Context Window131K tokens
Max Output33K tokens
Input ModalitiesImagePDFTextVideo
Output ModalitiesText
Reasoning Effortsdefault
Parameters108B
HuggingFace Likes390
HuggingFace Downloads (30d)6,998
HuggingFace Downloads (all-time)405,792
Release Date · 6 months ago
Benchmarks
Intelligence Index
17.1
#265
Coding Index
11.1
#285
Math Index
26.3
#186
MMLU-Pro
0.8
#162
GPQA
0.6
#285
HLE
0.0
#428
LiveCodeBench
0.4
#162
IFBench
0.3
#349
Time to First Token
1.55s
#396
SciCode
0.3
#274
AIME 2025
0.3
#186
LCR
0.1
#288
TerminalBench Hard
0.0
#292
TAU2
0.3
#231
Output TPS
68.0
#171

Capabilities

Input4/5
Text
Image
Audio·
Video
PDF
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities6/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
Hugging Face logo
Hugging Face
novita:zai-org/glm-4.6v
$0.3$0.9N/A
Novita logo
Novita
novita/zai-org/glm-4.6v
$0.3$0.9$0.055
OpenRouter logo
OpenRouter
z-ai/glm-4.6v
$0.3$0.9$0.055
Vercel AI Gateway logo
Vercel AI Gateway
zai/glm-4.6v
$0.3$0.9$0.05

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
GLM-4.6V131K$0.300$0.900Current
GLM-4.5V131K$0.600$1.20Available

Model IDs

glm-4-6v
glm-4-6v-reasoning
novita/zai-org/glm-4.6v
z-ai/glm-4.6v
zai-org/glm-4.6v
zai-org/GLM-4.6V
zai/glm-4.6v
zhipu-glm-4-6v