Name: Inworld Realtime TTS 1.5 Max
Brand: Inworld

Inworld Realtime TTS 1.5 Max is Inworld's text to speech model. A high-quality multilingual TTS model from Inworld AI supporting 130+ preset voices across 15 languages with voice cloning, word-level timestamps, and streaming.

Specifications
Canonical ID	`inworld-realtime-tts-1-5-max`
Type	Text to Speech
Status	Active
Creator	Inworld
Input Modalities	Text
Output Modalities	Audio

Capabilities

Input1/5

Text✓

Image·

Audio·

Video·

PDF·

Output1/5

Text·

Image·

Audio✓

Video·

Embedding·

Capabilities0/13

Reasoning·

Adaptive Reasoning·

Function Calling·

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
Inworld Realtime TTS 1.5 Max	—	—	—	—	Current
Inworld TTS 1.5 Max	—	—	—	—	Available

Other Models

Model	Tier	Released	Context	Input / 1M	Output / 1M
Inworld Realtime TTS 2	—	—	—	—	—
Step TTS 2	—	—	—	—	—
StyleTTS 2	—	—	—	—	—
TTS HD 2.5	—	—	—	—	—
TTS 1	—	2023-11-06	—	$15.00	—
TTS 1 HD	—	2023-11-06	—	$30.00	—
Inworld TTS 1 Max	—	—	—	—	—
Inworld Realtime TTS 1.5 Mini	Mini	—	—	—	—
Inworld TTS 1.5 Mini	Mini	—	—	—	—
Azure Neural	—	—	—	—	—

Model IDs

inworld-ai/realtime-tts-1.5-max

inworld-realtime-tts-1-5-max

Inworld Realtime TTS 1.5 Max

CapabilitiesAPIGET/api/v1/models/inworld-realtime-tts-1-5-max

VersionsAPIGET/api/v1/models?family=tts

Other ModelsAPIGET/api/v1/models/inworld-realtime-tts-1-5-max/similar

Model IDsAPIGET/api/v1/models/inworld-realtime-tts-1-5-max

Capabilities

Versions

Other Models

Model IDs