Qwen3 4B Instruct 2507
Qwen3 4B Instruct 2507 is a text model from
Fireworks AI with a context window of 262K tokens and max output of 262K tokens. Pricing starts at 0.20 per million input tokens and 0.20 per million output tokens (cheapest at Lemonade (AMD)).
Capabilities
✗ Vision✗ Function Calling✗ Reasoning✗ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output
Specifications
| Model Key | fireworks_ai/accounts/fireworks/models/qwen3-4b-instruct-2507 |
| Provider | |
| Provider ID | fireworks_ai |
| Mode | Text |
| Canonical Name | qwen-3-4b-2507 |
| Context Window | 262K tokens |
| Max Output | 262K tokens |
Pricing
| Type | Per 1K Tokens | Per 1M Tokens |
|---|---|---|
| Input Tokens | 0.000200 | 0.200 |
| Output Tokens | 0.000200 | 0.200 |
Price Comparison by Provider
Compare prices for Qwen3 4B Instruct 2507 across different providers. The same model may be available through multiple providers at different price points.
Provider | Model Key | Input Price, $ | Output Price, $ |
|---|---|---|---|
| lemonade/Qwen3-4B-Instruct-2507-GGUF | N/A | N/A | |
| fireworks_ai/accounts/fireworks/models/qwen3-4b-instruct-2507 | 0.200 | 0.200 |
All Variants
All available versions, regions, and API endpoints for Qwen3 4B Instruct 2507.
Model Key | Provider | Mode | Input Price, $ | Output Price, $ | Context | Max Output | Vision | Functions |
|---|---|---|---|---|---|---|---|---|
| fireworks_ai/accounts/fireworks/models/qwen3-4b-instruct-2507 | Text | 0.200 | 0.200 | 262K | 262K | no | no | |
| lemonade/Qwen3-4B-Instruct-2507-GGUF | Text | N/A | N/A | 262K | 33K | no | yes |