Name: TFVision YOLOv7
Brand: Google

TFVision YOLOv7 is Google's image to text model. A TFVision-hosted YOLOv7 object detection model offering real-time detection with high accuracy across diverse visual categories.

Specifications
Canonical ID	`google-tfvision-yolov7`
Type	Image to Text
Status	Active
Creator	Google
Input Modalities	Image
Output Modalities	Text

Capabilities

Input1/5

Text·

Image✓

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities0/13

Reasoning·

Adaptive Reasoning·

Function Calling·

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
TFVision YOLOv7	—	—	—	—	Current
TFVision MoViNet VAR	—	—	—	—	Available
TFVision MoViNet VCN	—	—	—	—	Available

Model IDs

google_vertex_google/publishers/google/models/tfvision-yolov7
google-tfvision-yolov7