Name: Qwen3 VL 8B
Brand: Alibaba

Qwen3 VL 8B is Alibaba's language model with a 33K context window and up to 8K output tokens, starting at $0.15 / 1M input and $0.55 / 1M output. An 8B vision-language model from the Qwen3 series, providing capable multimodal understanding for image and text tasks at a mid-range parameter scale.

Specifications
Canonical ID	`alibaba-qwen3-vl-8b`
Type	Language
Status	Active
Creator	Alibaba
Providers	LlamaGate
Context Window	33K tokens
Max Output	8K tokens
Input Modalities	Image
Output Modalities	Text
Parameters	8B

Capabilities

Input1/5

Text·

Image✓

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities2/13

Reasoning·

Adaptive Reasoning·

Function Calling✓

Parallel Function Calling·

Structured Outputs✓

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Pricing by Provider

US Dollar ($)

Per 1M tokens

Provider	Standard
Provider	Input $ / 1M	Output $ / 1M
LlamaGate `llamagate/qwen3-vl-8b`	$0.15	$0.55

Cost Calculator

US Dollar ($)

Preset:

Input tokens

Output tokens

Number of calls

Cheapest Instances to Run It

Cloud GPU instances that can host Qwen3 VL 8B, ranked by cheapest on-demand price. The model needs about 19 GB of GPU memory at FP16 precision (estimated from its parameter count), so treat the fit as guidance rather than a guarantee.

All clouds

FP16 (full precision)

US Dollar ($)

Instance	Cloud	GPU	VRAM	Price	Cheapest region
g2-standard-4	GCP	nvidia-l4	24 GB	$0.705/hr	us-east4
g6.xlarge	AWS	L4	22 GB	$0.805/hr	us-east-1
g2-standard-8	GCP	nvidia-l4	24 GB	$0.851/hr	us-east4
7 more instances can run Qwen3 VL 8B Unlock the full ranked list and FP8 / INT4 quantization with a CloudPrice subscription.

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
Qwen Audio 3 Flash Realtime	—	—	$0.413	$4.13	Available
Qwen Audio 3 Plus Realtime	—	—	$0.688	$5.50	Available
Qwen Audio 3 Plus TTS	—	—	—	—	Available
EAGLE Qwen 2.5 3B Instruct	—	—	—	—	Available
Qwen3.7 Plus	2026-06-03	1.0M	$0.320	$1.28	Available
Qwen3.7 Max	2026-05-21	1.0M	$1.25	$3.75	Available
Qwen3.6 Max Preview	2026-04-27	262K	$1.04	$6.24	Available
Qwen3.6 27B	2026-04-27	262K	$0.150	$0.500	Available
Qwen3.6 35B A3B	2026-04-27	262K	$0.140	$0.450	Available
Qwen3.6 Plus	2026-04-02	1.0M	$0.325	$1.95	Available
Qwen3 VL 8B	—	33K	$0.150	$0.550	Current

Model IDs

alibaba-qwen3-vl-8b

llamagate/qwen3-vl-8b

qwen3-vl-8b-reasoning

Qwen3 VL 8B

CapabilitiesAPIGET/api/v1/models/alibaba-qwen3-vl-8b

Pricing by ProviderAPIGET/api/v1/models/alibaba-qwen3-vl-8b/pricing

Cost CalculatorAPIGET/api/v1/models/alibaba-qwen3-vl-8b/pricing/calculate?input_tokens=1000000&output_tokens=500000

Cheapest Instances to Run ItAPIGET/api/v1/models/alibaba-qwen3-vl-8b/instances

VersionsAPIGET/api/v1/models?family=qwen

Model IDsAPIGET/api/v1/models/alibaba-qwen3-vl-8b

Capabilities

Pricing by Provider

Cost Calculator

Cheapest Instances to Run It

Versions

Model IDs