Woven City AI Vision Engine is Wovenbytoyota's image to text model. A multimodal LLM that processes text and images/videos simultaneously, deployed on Amazon SageMaker and designed for visual question answering and spatial reasoning tasks.
wovenbytoyota-woven-city-ai-vision-engine |
| Image to Text |
| Active |
| Wovenbytoyota |
| Image |
| Text |
Capabilities
Input1/5
·
✓
·
·
·
Output1/5
✓
·
·
·
·
Capabilities0/13
·
·
·
·
·
·
·
·
·
·
·
·
·