GPT-4 Turbo Vision Preview is OpenAI's language model with a 128K context window and up to 4K output tokens, starting at $10.00 / 1M input and $30.00 / 1M output. A preview-stage multimodal LLM combining GPT-4 Turbo's extended context with vision input capabilities for image understanding tasks.
Specifications
Canonical IDopenai-gpt-4-turbo-vision-preview
TypeLanguage
StatusActive
CreatorOpenAIOpenAI
Providers
Context Window128K tokens
Max Output4K tokens
Input ModalitiesImage
Output ModalitiesText

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Azure AI Foundry logo
Azure AI Foundry
azure/gpt-4-turbo-vision-preview
$10.00$30.00

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
GPT-5.51.1M$5.00$30.00Available
GPT-5.4 Mini1.1M$0.750$4.50Available
GPT-5.4 Nano1.1M$0.200$1.25Available
GPT-5.41.1M$2.50$15.00Available
GPT-5.3 Codex400K$1.75$14.00Available
GPT-5.2 Codex400K$1.75$14.00Available
GPT-5.2410K$1.75$14.00Available
GPT-5.1410K$1.25$10.00Available
GPT-5.1 Codex400K$1.25$10.00Available
GPT-5.1 Codex Mini400K$0.250$2.00Available
GPT-4 Turbo Vision Preview128K$10.00$30.00Current

Model IDs