Alibaba logo

Qwen2 VL 2B Instruct


Qwen2 VL 2B Instruct is Alibaba logoAlibaba's language model with a 33K context window, starting at $0.100 / 1M input and $0.100 / 1M output. A compact 2-billion-parameter vision-language model from the Qwen2-VL series, enabling multimodal image and text understanding in lightweight deployments.
Spec
Canonical IDalibaba-qwen2-vl-2b-instruct
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window33K tokens
Input ModalitiesText
Output ModalitiesText
Parameters2B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Fireworks AI logo
Fireworks AI
accounts/fireworks/models/qwen2-vl-2b-instruct
$0.100$0.100

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Qwen2 VL 2B Instruct33K$0.100$0.100Current
Qwen2 VL 72B Instruct131K$0.130$0.400Available
Qwen2 VL 7B Instruct131K$0.020$0.060Available

Model IDs