Alibaba logo

Flash US


Flash US is Alibaba logoAlibaba's language model with a 1.0M context window and up to 33K output tokens, starting at $0.050 / 1M input and $0.400 / 1M output. A US-region-routed variant of Alibaba's Flash-tier LLM for low-latency inference.
Spec
Canonical IDalibaba-flash-us
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window1.0M tokens
Max Output33K tokens
Input ModalitiesText
Output ModalitiesText

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Alibaba Qwen logo
Alibaba Qwen
qwen-flash-us
$0.050$0.400$0.025$0.200

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Flash US1.0M$0.050$0.400Current
Flash1.0M$0.050$0.400Available
MT Flash16K$0.160$0.490Available

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
Coder1.0M$0.300$1.50
Deep Research1.0M$7.74$23.37
Long10.0M$0.072$0.287
MT Lite16K$0.120$0.360
Image MaxMax
Coder PlusPlus131K$0.502$1.00
Math PlusPlus4K$0.574$1.72
Coder TurboTurbo131K$0.287$0.861
Doc TurboTurbo262K$0.087$0.144
Math TurboTurbo4K$0.287$0.861

Model IDs