Qwen3 235B A22B FP8 Tput is Alibaba's language model with a 40K context window, starting at $0.200 / 1M input and $0.600 / 1M output. A throughput-optimized FP8-quantized variant of the Qwen3 235B A22B MoE model, balancing inference speed with large-scale reasoning capability.
Specifications
Canonical IDalibaba-qwen3-235b-a22b-fp8-tput
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window40K tokens
Input ModalitiesText
Output ModalitiesText
Parameters235B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Together AI logo
Together AI
together_ai/Qwen/Qwen3-235B-A22B-fp8-tput
$0.200$0.600

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
EAGLE Qwen 2.5 3B InstructAvailable
Qwen3.7 Max1.0M$2.50$7.50Available
Qwen3.6 Max Preview262K$1.04$6.24Available
Qwen3.6 27B262K$0.300$3.20Available
Qwen3.6 35B A3B262K$0.150$1.00Available
Qwen3.6 Plus1.0M$0.179$1.07Available
Qwen3 Max Thinking262K$0.780$3.90Available
Qwen3 Next 80B A3B128K$0.140$1.20Available
Qwen3 Max262K$0.359$1.43Available
Qwen3 Max Preview262K$1.20$6.00Available
Qwen3 235B A22B FP8 Tput40K$0.200$0.600Current

Model IDs