Qwen3 Vl 8B Instruct

Qwen3 Vl 8B Instruct is a text model from Fireworks AI with a context window of 4K tokens and max output of 4K tokens. Pricing starts at 0.20 per million input tokens and 0.20 per million output tokens (cheapest at LlamaGate).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keyfireworks_ai/accounts/fireworks/models/qwen3-vl-8b-instruct
ProviderFireworks AI
Provider IDfireworks_ai
ModeText
Canonical Nameqwen-vl-3-8b
Context Window4K tokens
Max Output4K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0002000.200
Output Tokens0.0002000.200

Benchmarks

Intelligence Index14.3#139
Coding Index7.3#149
Math Index27.3#87
MMLU-Pro0.7#120
GPQA0.4#158
HLE0.0#219
LiveCodeBench0.3#99
IFBench0.3#124
Time to First Token1.02s#174
SciCode0.2#172
AIME 20250.3#87
LCR0.2#120
TerminalBench Hard0.0#124
TAU20.3#86

Price Comparison by Provider

Compare prices for Qwen3 Vl 8B Instruct across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
LlamaGatellamagate/qwen3-vl-8b0.1500.550
Fireworks AIfireworks_ai/accounts/fireworks/models/qwen3-vl-8b-instruct0.2000.200

All Variants

All available versions, regions, and API endpoints for Qwen3 Vl 8B Instruct.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
fireworks_ai/accounts/fireworks/models/qwen3-vl-8b-instructFireworks AIText0.2000.2004K4Knono
llamagate/qwen3-vl-8bLlamaGateText0.1500.55033K8Kyesyes