Alibaba logo

QVQ 72B Preview


QVQ 72B Preview is Alibaba logoAlibaba's language model with a 33K context window and up to 16K output tokens, starting at $1.72 / 1M input and $5.16 / 1M output. A 72B multimodal reasoning model from Alibaba's QvQ series, offering image-text-to-text capabilities with a focus on visual question answering and complex reasoning.
Spec
Canonical IDalibaba-qvq-72b-preview
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window33K tokens
Max Output16K tokens
Input ModalitiesText
Output ModalitiesText
Parameters72B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Alibaba Qwen logo
Alibaba Qwen
qvq-72b-preview
$1.72$5.16$0.861$2.58

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
QVQ 72B Preview33K$1.72$5.16Current
QVQ Max131K$1.20$4.80Available
QVQ Plus131K$0.287$0.717Available

Model IDs