Alibaba logo

Qwen3 1.7B FP8 Draft 131072


Qwen3 1.7B FP8 Draft 131072 is Alibaba logoAlibaba's language model with a 131K context window, starting at $0.100 / 1M input and $0.100 / 1M output. A 1.7-billion-parameter FP8-quantized draft LLM from Alibaba's Qwen3 series with a 131,072-token context window for long-context speculative decoding.
Spec
Canonical IDalibaba-qwen3-1-7b-fp8-draft-131072
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window131K tokens
Input ModalitiesText
Output ModalitiesText
Parameters1.7B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Fireworks AI logo
Fireworks AI
fireworks_ai/accounts/fireworks/models/qwen3-1p7b-fp8-draft-131072
$0.100$0.100

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
EAGLE Qwen 2.5 3B InstructAvailable
Qwen3 Max Thinking262K$0.780$3.90Available
Qwen3 Next 80B A3B128K$0.150$1.20Available
Qwen3 VL 235B A22B128K$0.530$2.66Available
Qwen3 VL 8B Thinking131K$0.117$1.36Available
Qwen3 VL 235B A22B Instruct131K$0.400$1.60Available
Qwen3 VL 235B A22B Thinking131K$0.400$4.00Available
Qwen3 Coder Plus1.0M$0.650$3.25Available
Qwen3 Max262K$0.359$1.43Available
Qwen3 Max Preview262K$1.20$6.00Available
Qwen3 1.7B FP8 Draft 131072131K$0.100$0.100Current

Model IDs