Qwen3 VL 8B Thinking is Alibaba's language model with a 256K context window and up to 33K output tokens, available from 2 providers, starting at $0.117 / 1M input and $1.36 / 1M output. A reasoning-optimized 8B multimodal language model in the Qwen3-VL series, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences.
Specifications
Canonical IDalibaba-qwen3-vl-8b-thinking
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window256K tokens
Max Output33K tokens
Input ModalitiesImageText
Output ModalitiesText
Reasoning Effortsdefault
Parameters8B
HuggingFace Likes210
HuggingFace Downloads (30d)254,861
HuggingFace Downloads (all-time)2,113,655
Release Date · 8 months ago

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities4/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Alibaba Qwen logo
Alibaba Qwen
qwen3-vl-8b-thinking
$0.18$2.10$0.09$1.05
OpenRouter logo
OpenRouter
qwen/qwen3-vl-8b-thinking
$0.117$1.36

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
EAGLE Qwen 2.5 3B InstructAvailable
Qwen3.7 Plus1.0M$0.320$1.28Available
Qwen3.7 Max1.0M$1.25$3.75Available
Qwen3.6 Max Preview262K$1.04$6.24Available
Qwen3.6 27B262K$0.288$3.17Available
Qwen3.6 35B A3B262K$0.140$1.00Available
Qwen3.6 Plus1.0M$0.325$1.95Available
Qwen3 Max Thinking262K$0.780$3.90Available
Qwen3 Next 80B A3B128K$0.140$1.20Available
Qwen3 VL 8B Thinking256K$0.117$1.36Current
Qwen3 Max262K$0.359$1.43Available

Model IDs

alibaba-qwen3-vl-8b-thinking
qwen/qwen3-vl-8b-thinking
qwen3-vl-8b-thinking