Gemini 2.5 Flash

ReplicateText

Gemini 2.5 Flash is a text model from Replicate. Pricing starts at $2.50 per million input tokens and $2.50 per million output tokens (cheapest at Google Gemini).

Specifications

Model Keyreplicate/google/gemini-2.5-flash
ProviderReplicate
LiteLLM Providerreplicate
ModeText
Canonical Namegemini-flash-2.5
Context WindowN/A tokens
Max OutputN/A

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens$0.0025$2.50
Output Tokens$0.0025$2.50

Price Comparison by Provider

Compare prices for Gemini 2.5 Flash across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price
Output Price
Databricksdatabricks/databricks-gemini-2-5-flash$0.300$2.50
Deepinfradeepinfra/google/gemini-2.5-flash$0.300$2.50
Replicatereplicate/google/gemini-2.5-flash$2.50$2.50
Vercel Ai Gatewayvercel_ai_gateway/google/gemini-2.5-flash$0.300$2.50
Google Vertex AIgemini-2.5-flash-preview-04-17$0.150$0.600
Google Geminigemini/gemini-2.5-flash-preview-04-17$0.150$0.600

Similar Models

Models with similar capabilities and context window size.

Model
Provider
Mode
Input Price
Output Price
Context
Max Output
Vision
Functions
DeepSeek R1 Distill Llama 8BNscaleText$0.025$0.025N/AN/Anono
DeepSeek R1 Distill Qwen 14BNscaleText$0.070$0.070N/AN/Anono
GPT-5 nanoReplicateText$0.050$0.400N/AN/Anoyes
Granite 3.3 8B InstructReplicateText$0.030$0.250N/AN/Anoyes
Llama 3.1 8B InstructNscaleText$0.030$0.030N/AN/Anono
Llama 3.3 70B Instruct Turbo FreeTogether AITextN/AN/AN/AN/Anoyes
Qwen2.5 Coder 32B InstructNscaleText$0.060$0.200N/AN/Anono
Qwen2.5 Coder 3B InstructNscaleText$0.010$0.030N/AN/Anono
Qwen2.5 Coder 7B InstructNscaleText$0.010$0.030N/AN/Anono
Titan Embed Text V2Vercel Ai GatewayText$0.020N/AN/AN/Anono