Kimi K2 Turbo is Moonshot AI (Kimi)'s language model with a 256K context window and up to 16K output tokens. A high-throughput variant of Kimi K2 delivering up to 100 tokens per second, optimized for speed-critical tool-use and conversational applications.
Specifications
Canonical IDmoonshot-kimi-k2-turbo
TypeLanguage
StatusActive
CreatorMoonshot AI (Kimi)Moonshot AI (Kimi)
Providers
Context Window256K tokens
Max Output16K tokens
Input ModalitiesText
Output ModalitiesText
Release Date · 10 months ago

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities1/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
Vercel AI Gateway logo
Vercel AI Gateway
moonshotai/kimi-k2-turbo
$1.15$8.00$0.15

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Kimi K2.7 Code High Speed262K$1.90$8.00Available
Kimi K2 Thinking Turbo262K$1.15$8.00Deprecated
Kimi K2 Turbo256KCurrent
Kimi K2.5 Non-ReasoningAvailable
Kimi K2 Instruct V0Available
Kimi K2 Preview262K$0.600$2.50Deprecated
Kimi K2 Turbo Preview262K$1.15$8.00Deprecated
Kimi K2.5 Thinking$0.600$3.00Available
Kimi K2.6 Thinking$0.950$4.00Available
Kimi262K$0.680$3.41Deprecated
Kimi Linear 48B A3B InstructAvailable

Model IDs

moonshot-kimi-k2-turbo
moonshotai/kimi-k2-turbo