Alibaba logo

Flash


Flash is Alibaba logoAlibaba's language model with a 1.0M context window and up to 33K output tokens, starting at $0.050 / 1M input and $0.400 / 1M output. A lightweight, high-speed Alibaba LLM tier designed for low-latency inference on everyday tasks.
Spec
Canonical IDalibaba-flash
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window1.0M tokens
Max Output33K tokens
Input ModalitiesText
Output ModalitiesText
Reasoning Effortsdefault

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities2/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Alibaba Qwen logo
Alibaba Qwen
qwen-flash
$0.050$0.400$0.025$0.200

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Flash1.0M$0.050$0.400Current
Flash US1.0M$0.050$0.400Available
MT Flash16K$0.160$0.490Available

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
Coder1.0M$0.300$1.50
Deep Research1.0M$7.74$23.37
Long10.0M$0.072$0.287
MT Lite16K$0.120$0.360
Image MaxMax
Coder PlusPlus131K$0.502$1.00
Math PlusPlus4K$0.574$1.72
Coder TurboTurbo131K$0.287$0.861
Doc TurboTurbo262K$0.087$0.144
Math TurboTurbo4K$0.287$0.861

Model IDs