CosyVoice 3.5 Plus is Alibaba's language model, starting at $0.22 / 1M input. An enhanced-tier CosyVoice 3.5 TTS model offering higher quality speech synthesis with richer expressiveness.
Specifications
Canonical IDalibaba-cosyvoice-3-5-plus
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Input ModalitiesText
Output ModalitiesText

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandardBatch
Input
$ / 1M
Input
$ / 1M
Alibaba Qwen logo
Alibaba Qwen
cosyvoice-v3.5-plus
$0.22$0.11

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
CosyVoice 3.5 Plus$0.220Current
CosyVoice 3 Plus$0.260Available

Other Models

ModelTierReleasedContextInput / 1MOutput / 1M
CosyVoice 3 FlashFlash$0.130
CosyVoice 3.5 FlashFlash$0.116
CosyVoice 2$0.287

Model IDs

alibaba-cosyvoice-3-5-plus
cosyvoice-v3.5-plus