Qwen3 Vl 8 b Pricing & Specs | AI Models

Qwen3 Vl 8B is a text model from LlamaGate with a context window of 33K tokens and max output of 8K tokens. Pricing starts at 0.15 per million input tokens and 0.55 per million output tokens (cheapest at LlamaGate).

Capabilities

✓ Vision✓ Function Calling✗ Reasoning✓ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output

Specifications

Model Key	`llamagate/qwen3-vl-8b`
Provider	LlamaGate
Provider ID	llamagate
Mode	Text
Canonical Name	qwen-vl-3-8b
Context Window	33K tokens
Max Output	8K tokens

Pricing

Type	Per 1K Tokens	Per 1M Tokens
Input Tokens	0.000150	0.150
Output Tokens	0.000550	0.550

Benchmarks

Intelligence Index	14.3#139
Coding Index	7.3#149
Math Index	27.3#87
MMLU-Pro	0.7#120
GPQA	0.4#158
HLE	0.0#219
LiveCodeBench	0.3#99
IFBench	0.3#124
Time to First Token	1.02s#174
SciCode	0.2#172
AIME 2025	0.3#87
LCR	0.2#120
TerminalBench Hard	0.0#124
TAU2	0.3#86

Price Comparison by Provider

Compare prices for Qwen3 Vl 8B across different providers. The same model may be available through multiple providers at different price points.

Provider	Model Key	Input Price, $	Output Price, $
LlamaGate	llamagate/qwen3-vl-8b	0.150	0.550
Fireworks AI	fireworks_ai/accounts/fireworks/models/qwen3-vl-8b-instruct	0.200	0.200

All Variants

All available versions, regions, and API endpoints for Qwen3 Vl 8B.

Model Key	Provider	Mode	Input Price, $	Output Price, $	Context	Max Output	Vision	Functions
fireworks_ai/accounts/fireworks/models/qwen3-vl-8b-instruct	Fireworks AI	Text	0.200	0.200	4K	4K	no	no
llamagate/qwen3-vl-8b	LlamaGate	Text	0.150	0.550	33K	8K	yes	yes

← Back to All Models