ImageBind is Meta's embedding model. A multimodal embedding model that binds representations across six modalities—images, text, audio, depth, thermal, and IMU—into a shared embedding space.
Capabilities
Input1/5
✓
·
·
·
·
Output1/5
·
·
·
·
✓
Capabilities0/13
·
·
·
·
·
·
·
·
·
·
·
·
·