Llama 4 Scout 17B 128e Instruct Maas

Google Vertex AIText

Llama 4 Scout 17B 128e Instruct Maas is a text model from Google Vertex AI with a context window of 10.0M tokens and max output of 10.0M tokens. Pricing starts at $0.25 per million input tokens and $0.70 per million output tokens.

Specifications

Model Keyvertex_ai/meta/llama-4-scout-17b-128e-instruct-maas
ProviderGoogle Vertex AI
LiteLLM Providervertex_ai-llama_models
ModeText
Canonical Namellama-scout-4-17b-128e
Context Window10.0M tokens
Max Output10.0M tokens

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens$0.000250$0.250
Output Tokens$0.000700$0.700

Similar Models

Models with similar capabilities and context window size.

Model
Provider
Mode
Input Price
Output Price
Context
Max Output
Vision
Functions
Llama 4 Scout 17B 16E InstructAzure AIText$0.200$0.78010.0M16Kyesyes
Llama 4 Scout 17B 16E Instruct FP8Meta LlamaTextN/AN/A10.0M4Knoyes
Llama 4 Scout 17B 16e Instruct MaasGoogle Vertex AIText$0.250$0.70010.0M10.0Mnoyes