GLM-4.6V is Zhipu AI's language model with a 131K context window and up to 33K output tokens, available from 4 providers, starting at $0.300 / 1M input and $0.900 / 1M output. A multimodal MoE vision-language model from Z AI in the GLM-V family, supporting vision, file input, and scalable reinforcement learning-based reasoning.
Specifications
Canonical IDzhipu-glm-4-6v
TypeLanguage
StatusActive
CreatorZhipu AIZhipu AI
Providers
Context Window131K tokens
Max Output33K tokens
Input ModalitiesImagePdfTextVideo
Output ModalitiesText
Reasoning Effortsdefault
HuggingFace Likes390
HuggingFace Downloads (30d)6,998
HuggingFace Downloads (all-time)405,792
Release Date · 6 months ago
Benchmarks
Intelligence Index
17.1
#256
Coding Index
11.1
#278
Math Index
26.3
#186
MMLU-Pro
0.8
#162
GPQA
0.6
#278
HLE
0.0
#418
LiveCodeBench
0.4
#162
IFBench
0.3
#339
Time to First Token
1.32s
#375
SciCode
0.3
#267
AIME 2025
0.3
#186
LCR
0.1
#281
TerminalBench Hard
0.0
#284
TAU2
0.3
#222
Output TPS
42.3
#238

Capabilities

Input4/5
Text
Image
Audio·
Video
PDF
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities6/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
Hugging Face logo
Hugging Face
novita:zai-org/glm-4.6v
$0.300$0.900N/A
Novita logo
Novita
novita/zai-org/glm-4.6v
$0.300$0.900$0.055
OpenRouter logo
OpenRouter
z-ai/glm-4.6v
$0.300$0.900$0.050
Vercel AI Gateway logo
Vercel AI Gateway
zai/glm-4.6v
$0.300$0.900$0.050

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
GLM-4.6V131K$0.300$0.900Current
GLM-4.5V131K$0.600$1.20Available

Model IDs