Alibaba logo

Qwen2 VL 7B Instruct


Qwen2 VL 7B Instruct is Alibaba logoAlibaba's language model with a 131K context window, available from 2 providers, starting at $0.020 / 1M input and $0.060 / 1M output. A 7-billion-parameter vision-language model from the Qwen2-VL series, designed for practical multimodal tasks including visual question answering and image captioning.
Spec
Canonical IDalibaba-qwen2-vl-7b-instruct
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window131K tokens
Input ModalitiesImage
Output ModalitiesText
Parameters7B

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Nebius logo
Nebius
Qwen/Qwen2-VL-7B-Instruct
$0.020$0.060
Fireworks AI logo
Fireworks AI
accounts/fireworks/models/qwen2-vl-7b-instruct
$0.200$0.200

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Qwen2 VL 7B Instruct131K$0.020$0.060Current
Qwen2 VL 2B Instruct33K$0.100$0.100Available
Qwen2 VL 72B Instruct131K$0.130$0.400Available

Model IDs