LLaVA 7B is
Meta's language model with a 4K context window and up to 2K output tokens. A 7-billion-parameter multimodal model from the LLaVA family, enabling visual question answering and image-grounded conversation.
Capabilities
Input1/5
·
✓
·
·
·
Output1/5
✓
·
·
·
·
Capabilities1/13
·
·
·
·
✓
·
·
·
·
·
·
·
·
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| LLaVA 1.6 7B Mistral | — | 32K | $0.290 | $0.290 | Available |
| LLaVA 1.6 Mistral 7B | — | 32K | $0.290 | $0.290 | Available |
| LLaVA 7B | — | 4K | — | — | Current |
| LLaVA 7B | — | 4K | — | — | Available |
| LLaVA Yi 34B | — | 4K | $0.900 | $0.900 | Available |