Gemma 3 4B It GGUF
Gemma 3 4B It GGUF is a text model from
Lemonade (AMD) with a context window of 128K tokens and max output of 8K tokens.
Capabilities
✗ Vision✓ Function Calling✗ Reasoning✓ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output
Specifications
| Model Key | lemonade/Gemma-3-4b-it-GGUF |
| Provider | |
| Provider ID | lemonade |
| Mode | Text |
| Canonical Name | gemma-3-4b |
| Context Window | 128K tokens |
| Max Output | 8K tokens |
Pricing
| Type | Per 1K Tokens | Per 1M Tokens |
|---|---|---|
| Input Tokens | N/A | N/A |
| Output Tokens | N/A | N/A |
Price Comparison by Provider
Compare prices for Gemma 3 4B It GGUF across different providers. The same model may be available through multiple providers at different price points.
Provider | Model Key | Input Price, $ | Output Price, $ |
|---|---|---|---|
| LlamaGate | llamagate/gemma3-4b | 0.030 | 0.080 |
| lemonade/Gemma-3-4b-it-GGUF | N/A | N/A | |
| google.gemma-3-4b-it | 0.040 | 0.080 | |
| deepinfra/google/gemma-3-4b-it | 0.040 | 0.080 |
All Variants
All available versions, regions, and API endpoints for Gemma 3 4B It GGUF.
Model Key | Provider | Mode | Input Price, $ | Output Price, $ | Context | Max Output | Vision | Functions |
|---|---|---|---|---|---|---|---|---|
| google.gemma-3-4b-it | Text | 0.040 | 0.080 | 128K | 8K | yes | no | |
| deepinfra/google/gemma-3-4b-it | Text | 0.040 | 0.080 | 131K | 131K | no | yes | |
| lemonade/Gemma-3-4b-it-GGUF | Text | N/A | N/A | 128K | 8K | no | yes | |
| llamagate/gemma3-4b | LlamaGate | Text | 0.030 | 0.080 | 128K | 8K | yes | yes |