Qwen3 Vl 8B
Qwen3 Vl 8B is a text model from LlamaGate with a context window of 33K tokens and max output of 8K tokens. Pricing starts at 0.15 per million input tokens and 0.55 per million output tokens (cheapest at LlamaGate).
Capabilities
✓ Vision✓ Function Calling✗ Reasoning✓ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output
Specifications
| Model Key | llamagate/qwen3-vl-8b |
| Provider | LlamaGate |
| Provider ID | llamagate |
| Mode | Text |
| Canonical Name | qwen-vl-3-8b |
| Context Window | 33K tokens |
| Max Output | 8K tokens |
Pricing
| Type | Per 1K Tokens | Per 1M Tokens |
|---|---|---|
| Input Tokens | 0.000150 | 0.150 |
| Output Tokens | 0.000550 | 0.550 |
Price Comparison by Provider
Compare prices for Qwen3 Vl 8B across different providers. The same model may be available through multiple providers at different price points.
Provider | Model Key | Input Price, $ | Output Price, $ |
|---|---|---|---|
| LlamaGate | llamagate/qwen3-vl-8b | 0.150 | 0.550 |
| fireworks_ai/accounts/fireworks/models/qwen3-vl-8b-instruct | 0.200 | 0.200 |
All Variants
All available versions, regions, and API endpoints for Qwen3 Vl 8B.
Model Key | Provider | Mode | Input Price, $ | Output Price, $ | Context | Max Output | Vision | Functions |
|---|---|---|---|---|---|---|---|---|
| fireworks_ai/accounts/fireworks/models/qwen3-vl-8b-instruct | Text | 0.200 | 0.200 | 4K | 4K | no | no | |
| llamagate/qwen3-vl-8b | LlamaGate | Text | 0.150 | 0.550 | 33K | 8K | yes | yes |