Qwen3 235B A22B FP8 Tput is Alibaba's language model with a 40K context window, starting at $0.2 / 1M input and $0.6 / 1M output. A throughput-optimized FP8-quantized variant of the Qwen3 235B A22B MoE model, balancing inference speed with large-scale reasoning capability.
Capabilities
Input1/5
Text✓
Image·
Audio·
Video·
PDF·
Output1/5
Text✓
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·
Pricing by Provider
US Dollar ($)
Per 1M tokens
| Provider | Standard | |
|---|---|---|
| Input $ / 1M | Output $ / 1M | |
| $0.2 | $0.6 | |
Cost Calculator
US Dollar ($)
Preset:
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| EAGLE Qwen 2.5 3B Instruct | — | — | — | — | Available |
| Qwen3.7 Plus | 1.0M | $0.400 | $1.60 | Available | |
| Qwen3.7 Max | 1.0M | $1.25 | $3.75 | Available | |
| Qwen3.6 Max Preview | 262K | $1.04 | $6.24 | Available | |
| Qwen3.6 27B | 262K | $0.289 | $2.40 | Available | |
| Qwen3.6 35B A3B | 262K | $0.150 | $1.00 | Available | |
| Qwen3.6 Plus | 1.0M | $0.325 | $1.95 | Available | |
| Qwen3 Max Thinking | 262K | $0.780 | $3.90 | Available | |
| Qwen3 Next 80B A3B | 128K | $0.140 | $1.20 | Available | |
| Qwen3 Max | 262K | $0.359 | $1.43 | Available | |
| Qwen3 235B A22B FP8 Tput | — | 40K | $0.200 | $0.600 | Current |