Alibaba logo

Qwen3 VL 8B


Qwen3 VL 8B is Alibaba logoAlibaba's language model with a 33K context window and up to 8K output tokens, starting at $0.150 / 1M input and $0.550 / 1M output. An 8B vision-language model from the Qwen3 series, providing capable multimodal understanding for image and text tasks at a mid-range parameter scale.
Spec
Canonical IDalibaba-qwen3-vl-8b
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window33K tokens
Max Output8K tokens
Input ModalitiesImage
Output ModalitiesText
Parameters8B

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities2/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Other/Llamagate
llamagate/qwen3-vl-8b
$0.150$0.550

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
EAGLE Qwen 2.5 3B InstructAvailable
Qwen3 Max Thinking262K$0.780$3.90Available
Qwen3 Next 80B A3B128K$0.150$1.20Available
Qwen3 VL 235B A22B128K$0.530$2.66Available
Qwen3 VL 8B Thinking131K$0.117$1.36Available
Qwen3 VL 235B A22B Instruct131K$0.400$1.60Available
Qwen3 VL 235B A22B Thinking131K$0.400$4.00Available
Qwen3 Coder Plus1.0M$0.650$3.25Available
Qwen3 Max262K$0.359$1.43Available
Qwen3 Max Preview262K$1.20$6.00Available
Qwen3 VL 8B33K$0.150$0.550Current

Model IDs