Qwen3 Vl 8 b Instruct Pricing & Specs | AI Models

Qwen3 Vl 8B Instruct is a text model from Fireworks AI with a context window of 4K tokens and max output of 4K tokens. Pricing starts at 0.20 per million input tokens and 0.20 per million output tokens (cheapest at LlamaGate).

Capabilities

✗ Vision✗ Function Calling✗ Reasoning✗ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output

Specifications

Model Key	`fireworks_ai/accounts/fireworks/models/qwen3-vl-8b-instruct`
Provider	Fireworks AI
Provider ID	fireworks_ai
Mode	Text
Canonical Name	qwen-vl-3-8b
Context Window	4K tokens
Max Output	4K tokens

Pricing

Type	Per 1K Tokens	Per 1M Tokens
Input Tokens	0.000200	0.200
Output Tokens	0.000200	0.200

Benchmarks

Intelligence Index	14.3#139
Coding Index	7.3#149
Math Index	27.3#87
MMLU-Pro	0.7#120
GPQA	0.4#158
HLE	0.0#219
LiveCodeBench	0.3#99
IFBench	0.3#124
Time to First Token	1.02s#174
SciCode	0.2#172
AIME 2025	0.3#87
LCR	0.2#120
TerminalBench Hard	0.0#124
TAU2	0.3#86

Price Comparison by Provider

Compare prices for Qwen3 Vl 8B Instruct across different providers. The same model may be available through multiple providers at different price points.

Provider	Model Key	Input Price, $	Output Price, $
LlamaGate	llamagate/qwen3-vl-8b	0.150	0.550
Fireworks AI	fireworks_ai/accounts/fireworks/models/qwen3-vl-8b-instruct	0.200	0.200

All Variants

All available versions, regions, and API endpoints for Qwen3 Vl 8B Instruct.

Model Key	Provider	Mode	Input Price, $	Output Price, $	Context	Max Output	Vision	Functions
fireworks_ai/accounts/fireworks/models/qwen3-vl-8b-instruct	Fireworks AI	Text	0.200	0.200	4K	4K	no	no
llamagate/qwen3-vl-8b	LlamaGate	Text	0.150	0.550	33K	8K	yes	yes

← Back to All Models