Qwen2.5 72B Instruct is Alibaba's language model with a 131K context window and up to 16K output tokens, available from 8 providers, starting at $0.120 / 1M input and $0.300 / 1M output. A 72-billion-parameter instruction-tuned LLM from Alibaba's Qwen2.5 series, excelling at natural language understanding, summarization, and dialogue.
Specifications
Canonical IDalibaba-qwen2-5-72b-instruct
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window131K tokens
Max Output16K tokens
Input ModalitiesText
Output ModalitiesText
Parameters72B
HuggingFace Likes927
HuggingFace Downloads (30d)457,915
HuggingFace Downloads (all-time)5,817,981
Release Date · 2 years ago
Knowledge Cutoff
Benchmarks
Intelligence Index
15.6
#280
Coding Index
11.9
#270
Math Index
14.0
#213
MMLU-Pro
0.7
#190
GPQA
0.5
#319
HLE
0.0
#362
LiveCodeBench
0.3
#223
AIME
0.2
#98
IFBench
0.4
#258
Time to First Token
1.15s
#349
SciCode
0.3
#273
MATH-500
0.9
#81
AIME 2025
0.1
#213
LCR
0.2
#245
TerminalBench Hard
0.0
#249
TAU2
0.3
#203
Output TPS
55.1
#202

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities4/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Dolphin 2.9.2 Qwen2 72B131K$0.900$0.900Available
DeepSeek R1 Distill Qwen 32B131K$0.150$0.150Available
DeepSeek R1 Distill Qwen 14B131K$0.070$0.070Available
DeepSeek R1 Distill Qwen 1.5B131K$0.090$0.090Available
Cogito V1 Preview Qwen 14B131K$0.200$0.200Available
Cogito V1 Preview Qwen 32B131K$0.900$0.900Available
DeepSeek R1 Distill Qwen 7B131K$0.072$0.144Available
QwQ 32B131K$0.150$0.200Available
Qwen2.5 Coder 32B Instruct131K$0.050$0.100Available
Qwen2.5 7B Instruct131K$0.040$0.070Available
Qwen2.5 72B Instruct131K$0.120$0.300Current

Model IDs