Alibaba logo

QVQ Max


QVQ Max is Alibaba logoAlibaba's language model with a 131K context window and up to 8K output tokens, starting at $1.20 / 1M input and $4.80 / 1M output. The top-tier variant of Alibaba's QvQ multimodal model series, designed for demanding visual reasoning and image-text understanding tasks.
Spec
Canonical IDalibaba-qvq-max
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window131K tokens
Max Output8K tokens
Input ModalitiesText
Output ModalitiesText

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Alibaba Qwen logo
Alibaba Qwen
qvq-max
$1.20$4.80$0.600$2.40

Cost Calculator

Preset:
Compares every provider & tier in USD

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
QVQ 72B Preview33K$1.72$5.16
QVQ PlusPlus131K$0.287$0.717

Model IDs