Flash US is Alibaba's language model with a 1.0M context window and up to 33K output tokens, starting at $0.05 / 1M input and $0.4 / 1M output. A US-region-routed variant of Alibaba's Flash-tier LLM for low-latency inference.
Capabilities
Input1/5
Text✓
Image·
Audio·
Video·
PDF·
Output1/5
Text✓
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·
Pricing by Provider
US Dollar ($)
Per 1M tokens
| Provider | Standard | Batch | ||
|---|---|---|---|---|
| Input $ / 1M | Output $ / 1M | Input $ / 1M | Output $ / 1M | |
| $0.05 | $0.4 | $0.025 | $0.2 | |
Cost Calculator
US Dollar ($)
Preset:
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| Flash US | — | 1.0M | $0.050 | $0.400 | Current |
| Flash | — | 1.0M | $0.050 | $0.400 | Available |
| MT Flash | — | 16K | $0.160 | $0.490 | Available |
Other Models
| Model | Tier | Released | Context | Input / 1M | Output / 1M |
|---|---|---|---|---|---|
| HappyHorse 1.0 | — | — | — | — | — |
| Coder | — | — | 1.0M | $0.300 | $1.50 |
| Deep Research | — | — | 1.0M | $7.74 | $23.37 |
| Long | — | — | 10.0M | $0.072 | $0.287 |
| Image Max | Max | — | — | — | — |
| Coder Plus | Plus | — | 131K | $0.502 | $1.00 |
| Math Plus | Plus | — | 4K | $0.574 | $1.72 |
| Coder Turbo | Turbo | — | 131K | $0.287 | $0.861 |
| Doc Turbo | Turbo | — | 262K | $0.087 | $0.144 |
| Math Turbo | Turbo | — | 4K | $0.287 | $0.861 |