Alibaba logo

Qwen3 8B FP8


Qwen3 8B FP8 is Alibaba logoAlibaba's language model with a 128K context window and up to 20K output tokens, starting at $0.035 / 1M input and $0.138 / 1M output. A compact 8B-parameter Qwen3 LLM quantized to FP8 precision, supporting seamless switching between reasoning and non-reasoning modes for efficient deployment.
Spec
Canonical IDalibaba-qwen3-8b-fp8
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window128K tokens
Max Output20K tokens
Input ModalitiesText
Output ModalitiesText
Reasoning Effortsdefault
Parameters8B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities1/13
Reasoning
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Novita logo
Novita
qwen/qwen3-8b-fp8
$0.035$0.138

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
DeepSeek R1 0528 Qwen3 8B128K$0.060$0.090Available
Qwen3 9.23 MaxAvailable
Qwen 7 28 Flash998KAvailable
Qwen 4 28 Plus129KAvailable
Qwen 3 32B128KAvailable
Qwen3.5-Flash1.0M$0.065$0.260Available
Qwen3.5 Plus 2026-02-151.0M$0.260$1.56Available
Qwen 1 25 Plus129KAvailable
Qwen3.5 Max258KAvailable
Qwen3.6 Plus1.0M$0.325$1.95Available
Qwen3 8B FP8128K$0.035$0.138Current

Model IDs