Gemini 3.1 Flash Lite Preview Pricing & Specs | AI Models

Gemini 3.1 Flash-Lite Preview is a text model from Google Vertex AI with a context window of 1.0M tokens and max output of 66K tokens. Pricing starts at 0.25 per million input tokens and 1.50 per million output tokens (cheapest at Google Gemini).

Capabilities

✓ Vision✓ Function Calling✓ Reasoning✓ JSON Schema✓ System Messages✓ Web Search✓ Prompt Caching✓ Audio Input✗ Audio Output

Specifications

Model Key	`vertex_ai/gemini-3.1-flash-lite-preview`
Provider	Google Vertex AI
Provider ID	vertex_ai-language-models
Mode	Text
Canonical Name	gemini-flash-lite-3.1
Context Window	1.0M tokens
Max Output	66K tokens

Pricing

Type	Per 1K Tokens	Per 1M Tokens
Input Tokens	0.000250	0.250
Output Tokens	0.0015	1.50
Cache Read (Input)	0.000025	0.025
Reasoning Tokens	0.0015	1.50

Benchmarks

Intelligence Index	33.5#34
Coding Index	30.1#39
GPQA	0.8#27
HLE	0.2#31
IFBench	0.8#3
Time to First Token	5.98s#213
SciCode	0.4#26
LCR	0.7#21
TerminalBench Hard	0.2#38
TAU2	0.3#78

Price Comparison by Provider

Compare prices for Gemini 3.1 Flash-Lite Preview across different providers. The same model may be available through multiple providers at different price points.

Provider	Model Key	Input Price, $	Output Price, $
Google Gemini	gemini/gemini-3.1-flash-lite-preview	0.250	1.50
Google Vertex AI	gemini-3.1-flash-lite-preview	0.250	1.50

All Variants

All available versions, regions, and API endpoints for Gemini 3.1 Flash-Lite Preview.

Model Key	Provider	Mode	Input Price, $	Output Price, $	Context	Max Output	Vision	Functions
gemini/gemini-3.1-flash-lite-preview	Google Gemini	Text	0.250	1.50	1.0M	66K	yes	yes
gemini-3.1-flash-lite-preview	Google Vertex AI	Text	0.250	1.50	1.0M	66K	yes	yes
vertex_ai/gemini-3.1-flash-lite-preview	Google Vertex AI	Text	0.250	1.50	1.0M	66K	yes	yes

← Back to All Models