Qwen3 VL 8B Instruct is Alibaba's language model with a 256K context window and up to 33K output tokens, available from 5 providers, starting at $0.080 / 1M input and $0.200 / 1M output. An instruction-tuned 8B vision-language model from the Qwen3 series, optimized for conversational multimodal tasks involving text and image inputs.
Specifications
Canonical IDalibaba-qwen3-vl-8b-instruct
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window256K tokens
Max Output33K tokens
Input ModalitiesImageText
Output ModalitiesText
Parameters8B
HuggingFace Likes874
HuggingFace Downloads (30d)3,765,920
HuggingFace Downloads (all-time)23,111,974
Release Date · 7 months ago
Benchmarks
Intelligence Index
14.3
#307
Coding Index
7.3
#323
Math Index
27.3
#182
MMLU-Pro
0.7
#217
GPQA
0.4
#344
HLE
0.0
#437
LiveCodeBench
0.3
#194
IFBench
0.3
#299
Time to First Token
0.95s
#319
SciCode
0.2
#351
AIME 2025
0.3
#182
LCR
0.2
#263
TerminalBench Hard
0.0
#285
TAU2
0.3
#225
Output TPS

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities4/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Novita logo
Novita
novita/qwen/qwen3-vl-8b-instruct
$0.080$0.500
OpenRouter logo
OpenRouter
qwen/qwen3-vl-8b-instruct
$0.080$0.500
Alibaba Qwen logo
Alibaba Qwen
qwen3-vl-8b-instruct
$0.180$0.700$0.090$0.350
Hugging Face logo
Hugging Face
together_ai:Qwen/Qwen3-VL-8B-Instruct
$0.180$0.680
Fireworks AI logo
Fireworks AI
fireworks_ai/accounts/fireworks/models/qwen3-vl-8b-instruct
$0.200$0.200

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Qwen3 VL 32B Instruct262K$0.104$0.416Available
Qwen3 VL 8B Instruct256K$0.080$0.200Current

Model IDs