Qwen3.5 397B A17B FP8 is Alibaba's language model with a 262K context window, starting at $0.6 / 1M input and $3.60 / 1M output. A massive 397B-parameter MoE LLM in the Qwen3.5 series quantized to FP8 precision, with 17B active parameters for efficient large-scale inference.
Specifications
Canonical IDalibaba-qwen3-5-397b-a17b-fp8
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window262K tokens
Input ModalitiesText
Output ModalitiesText
Reasoning Effortsdefault
Parameters397B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities4/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Input
$ / 1M
Output
$ / 1M
Tensormesh
tensormesh/Qwen/Qwen3.5-397B-A17B-FP8
$0.6$3.60

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
EAGLE Qwen 2.5 3B InstructAvailable
Qwen3.7 Plus1.0M$0.320$1.28Available
Qwen3.7 Max1.0M$1.25$3.75Available
Qwen3.6 Max Preview262K$1.04$6.24Available
Qwen3.6 27B262K$0.150$0.500Available
Qwen3.6 35B A3B262K$0.140$0.450Available
Qwen3.6 Plus1.0M$0.325$1.95Available
Qwen3 Max Thinking262K$0.780$3.90Available
Qwen3 Max262K$0.780$3.90Available
Qwen3 Coder 30B A3B262K$0.150$0.600Available
Qwen3.5 397B A17B FP8262K$0.600$3.60Current

Model IDs

alibaba-qwen3-5-397b-a17b-fp8
tensormesh/Qwen/Qwen3.5-397B-A17B-FP8