Gemma 3 4B Instruct is Google's language model with a 131K context window and up to 16K output tokens, available from 3 providers, starting at $0.04 / 1M input and $0.08 / 1M output. An instruction-tuned 4B Gemma 3 LLM supporting vision-language inputs for efficient multimodal tasks.
Specifications
Canonical IDgoogle-gemma-3-4b-instruct
TypeLanguage
StatusActive
CreatorGoogleGoogle
Providers
Context Window131K tokens
Max Output16K tokens
Input ModalitiesImageText
Output ModalitiesText
Parameters4B
HuggingFace Likes1,313
HuggingFace Downloads (30d)2,130,064
HuggingFace Downloads (all-time)17,437,981
Release Date · 1 year ago
Knowledge Cutoff · 2 years ago

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities3/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandardBatchFlexPriority
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Amazon Bedrock logo
Amazon Bedrock
google.gemma-3-4b-it
$0.04$0.08$0.02$0.04$0.02$0.04$0.07$0.14
DeepInfra logo
DeepInfra
deepinfra/google/gemma-3-4b-it
$0.04$0.08
OpenRouter logo
OpenRouter
google/gemma-3-4b-it
$0.05$0.1

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Gemma 3 4B Instruct131K$0.040$0.080Current
Gemma 3 12B Instruct131K$0.050$0.100Available
Rnj 1 Instruct33K$0.150$0.150Available
Gemma EmbeddingAvailable

Model IDs

accounts/fireworks/models/gemma-3-4b-it
deepinfra/google/gemma-3-4b-it
google-gemma-3-4b-instruct
google.gemma-3-4b-it
google/gemma-3-4b-it
google/gemma-3-4b-it:free
huggingface-vlm-gemma-3-4b-instruct
lemonade/Gemma-3-4b-it-GGUF