Gemini 3.1 Flash-Lite Preview

Gemini 3.1 Flash-Lite Preview is a text model from Google Vertex AI with a context window of 1.0M tokens and max output of 66K tokens. Pricing starts at 0.25 per million input tokens and 1.50 per million output tokens (cheapest at Google Gemini).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keyvertex_ai/gemini-3.1-flash-lite-preview
ProviderGoogle Vertex AI
Provider IDvertex_ai-language-models
ModeText
Canonical Namegemini-flash-lite-3.1
Context Window1.0M tokens
Max Output66K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0002500.250
Output Tokens0.00151.50
Cache Read (Input)0.0000250.025
Reasoning Tokens0.00151.50

Benchmarks

Intelligence Index33.5#34
Coding Index30.1#39
GPQA0.8#27
HLE0.2#31
IFBench0.8#3
Time to First Token5.98s#213
SciCode0.4#26
LCR0.7#21
TerminalBench Hard0.2#38
TAU20.3#78

Price Comparison by Provider

Compare prices for Gemini 3.1 Flash-Lite Preview across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
Google Geminigemini/gemini-3.1-flash-lite-preview0.2501.50
Google Vertex AIgemini-3.1-flash-lite-preview0.2501.50

All Variants

All available versions, regions, and API endpoints for Gemini 3.1 Flash-Lite Preview.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
gemini/gemini-3.1-flash-lite-previewGoogle GeminiText0.2501.501.0M66Kyesyes
gemini-3.1-flash-lite-previewGoogle Vertex AIText0.2501.501.0M66Kyesyes
vertex_ai/gemini-3.1-flash-lite-previewGoogle Vertex AIText0.2501.501.0M66Kyesyes