Name: Kokoro 1 82M
Brand: Kokoro

Kokoro 1 82M is Kokoro's text to speech model. A compact 82M-parameter text-to-speech model offering lightweight, efficient speech synthesis suitable for low-latency and on-device deployment.

Specifications
Canonical ID	`kokoro-1-82m`
Type	Text to Speech
Status	Active
Creator	Kokoro
Input Modalities	Text
Output Modalities	Audio
Parameters	0.08B

Benchmarks
Elo Rating	1057 #208

Capabilities

Input1/5

Text✓

Image·

Audio·

Video·

PDF·

Output1/5

Text·

Image·

Audio✓

Video·

Embedding·

Capabilities0/13

Reasoning·

Adaptive Reasoning·

Function Calling·

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Cloud GPU instances that can host Kokoro 1 82M, ranked by cheapest on-demand price. The model needs about 0 GB of GPU memory at FP16 precision (estimated from its parameter count), so treat the fit as guidance rather than a guarantee.

All clouds

FP16 (full precision)

US Dollar ($)

Instance	Cloud	GPU	VRAM	Price	Cheapest region
g6f.large	AWS	L4	3 GB	$0.202/hr	us-east-1
Standard_NV4as_v4	Azure	AMD Radeon Instinct MI25	16 GB	$0.233/hr	westus2
g6f.xlarge	AWS	L4	3 GB	$0.237/hr	us-east-1
7 more instances can run Kokoro 1 82M Unlock the full ranked list and FP8 / INT4 quantization with a CloudPrice subscription.