Gemma 3 4B is Google's language model with a 128K context window and up to 8K output tokens, starting at $0.030 / 1M input and $0.080 / 1M output. A 4-billion-parameter Gemma 3 open-weight LLM balancing multimodal capability with compact size.
Specifications
Canonical IDgoogle-gemma-3-4b
TypeLanguage
StatusActive
CreatorGoogleGoogle
Providers
Context Window128K tokens
Max Output8K tokens
Input ModalitiesImage
Output ModalitiesText
Parameters4B
Benchmarks
Intelligence Index
6.3
#461
Coding Index
2.9
#362
Math Index
12.7
#220
MMLU-Pro
0.4
#294
GPQA
0.3
#419
HLE
0.1
#257
LiveCodeBench
0.1
#291
AIME
0.1
#134
IFBench
0.3
#336
Time to First Token
SciCode
0.1
#407
MATH-500
0.8
#107
AIME 2025
0.1
#220
LCR
0.1
#313
TerminalBench Hard
0.0
#323
TAU2
0.0
#357
Output TPS
0.0
#364

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities2/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Other/Llamagate
llamagate/gemma3-4b
$0.030$0.080

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Gemma 4 31BAvailable
Gemma 4 31BAvailable
Gemma 4 26B A4BAvailable
Gemma 4 26B A4BAvailable
Gemma 4 E4BAvailable
Gemma 4 E2BAvailable
Gemma 4 E4BAvailable
Gemma 4 E2BAvailable
Gemma 4Available
Gemma 4 31B IT TurboAvailable
Gemma 3 4B128K$0.030$0.080Current

Model IDs