LLaVA Yi 34B is 01.AI's language model with a 4K context window and up to 4K output tokens, starting at $0.900 / 1M input and $0.900 / 1M output. A 34B multimodal vision-language model combining the LLaVA architecture with Yi's base model for visual question answering and image understanding.
Specifications
Canonical ID01-ai-llava-yi-34b
TypeLanguage
StatusActive
Creator01.AI01.AI
Providers
Context Window4K tokens
Max Output4K tokens
Input ModalitiesText
Output ModalitiesText
Parameters34B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Fireworks AI logo
Fireworks AI
fireworks_ai/accounts/fireworks/models/llava-yi-34b
$0.900$0.900

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
LLaVA Yi 34B4K$0.900$0.900Current
FireLLaVA 13B4K$0.200$0.200Available

Model IDs