Name: Qwen2.5 VL 7B Instruct
Brand: Alibaba

Qwen2.5 VL 7B Instruct is Alibaba's language model with a 128K context window and up to 8K output tokens, starting at $0.2 / 1M input and $0.2 / 1M output. A 7-billion-parameter multimodal vision-language LLM from Alibaba's Qwen2.5-VL series, enabling efficient image-text understanding and generation.

Specifications
Canonical ID	`alibaba-qwen2-5-vl-7b-instruct`
Type	Language
Status	Active
Creator	Alibaba
Providers	Fireworks AI
Context Window	128K tokens
Max Output	8K tokens
Input Modalities	Text
Output Modalities	Text
Parameters	7B

Capabilities

Input1/5

Text✓

Image·

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities0/13

Reasoning·

Adaptive Reasoning·

Function Calling·

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Pricing by Provider

US Dollar ($)

Per 1M tokens

Provider	Standard
Provider	Input $ / 1M	Output $ / 1M
Fireworks AI `fireworks_ai/accounts/fireworks/models/qwen2p5-vl-7b-instruct`	$0.2	$0.2

Cost Calculator

US Dollar ($)

Preset:

Input tokens

Output tokens

Number of calls

Cheapest Instances to Run It

Cloud GPU instances that can host Qwen2.5 VL 7B Instruct, ranked by cheapest on-demand price. The model needs about 17 GB of GPU memory at FP16 precision (estimated from its parameter count), so treat the fit as guidance rather than a guarantee.

All clouds

FP16 (full precision)

US Dollar ($)

Instance	Cloud	GPU	VRAM	Price	Cheapest region
g2-standard-4	GCP	nvidia-l4	24 GB	$0.705/hr	us-east4
g6.xlarge	AWS	L4	22 GB	$0.805/hr	us-east-1
g2-standard-8	GCP	nvidia-l4	24 GB	$0.851/hr	us-east4
7 more instances can run Qwen2.5 VL 7B Instruct Unlock the full ranked list and FP8 / INT4 quantization with a CloudPrice subscription.

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
Voyage Multimodal 3.5	—	—	—	—	Available
Qwen2.5 VL 72B Instruct	2025-02-01	131K	$0.130	$0.400	Available
Qwen2.5 VL 7B Instruct	—	128K	$0.200	$0.200	Current
Qwen2.5 VL 32B Instruct	—	128K	$0.200	$0.600	Available
Qwen2.5 VL 3B Instruct	—	128K	$0.200	$0.200	Available
Rolm OCR	—	128K	$0.200	$0.200	Available

Model IDs

accounts/fireworks/models/qwen2p5-vl-7b-instruct

alibaba-qwen2-5-vl-7b-instruct

fireworks_ai/accounts/fireworks/models/qwen2p5-vl-7b-instruct

qwen2.5-vl-7b-instruct

Qwen2.5 VL 7B Instruct

CapabilitiesAPIGET/api/v1/models/alibaba-qwen2-5-vl-7b-instruct

Pricing by ProviderAPIGET/api/v1/models/alibaba-qwen2-5-vl-7b-instruct/pricing

Cost CalculatorAPIGET/api/v1/models/alibaba-qwen2-5-vl-7b-instruct/pricing/calculate?input_tokens=1000000&output_tokens=500000

Cheapest Instances to Run ItAPIGET/api/v1/models/alibaba-qwen2-5-vl-7b-instruct/instances

VersionsAPIGET/api/v1/models?family=qwen2_5_vl

Model IDsAPIGET/api/v1/models/alibaba-qwen2-5-vl-7b-instruct

Capabilities

Pricing by Provider

Cost Calculator

Cheapest Instances to Run It

Versions

Model IDs