Name: LLaVA 7B
Brand: Haotian Liu

LLaVA 7B is Haotian Liu's language model with a 4K context window and up to 2K output tokens. A 7B-parameter multimodal LLM combining a vision encoder with a language model for visual question answering and image-text understanding.

Spec
Canonical ID	`haotian-liu-llava-7b`
Type	Language
Status	Active
Creator	Haotian Liu
Context Window	4K tokens
Max Output	2K tokens
Input Modalities	Image
Output Modalities	Text
Parameters	7B

Capabilities

Input1/5

Text·

Image✓

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities1/13

Reasoning·

Adaptive Reasoning·

Function Calling·

Parallel Function Calling·

Structured Outputs✓

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
LLaVA 1.6 7B Mistral	—	32K	$0.290	$0.290	Available
LLaVA 1.6 Mistral 7B	—	32K	$0.290	$0.290	Available
LLaVA 7B	—	4K	—	—	Current
LLaVA 7B	—	4K	—	—	Available
LLaVA Yi 34B	—	4K	$0.900	$0.900	Available

Model IDs

haotian-liu-llava-7b
litellm/llamagate/llava-7b
llamagate/llava-7b