Llama 3.2 3B Instruct
Llama 3.2 3B Instruct is a text model from
DeepInfra with a context window of 131K tokens and max output of 131K tokens. Pricing starts at 0.02 per million input tokens and 0.02 per million output tokens (cheapest at Lambda).
Capabilities
✗ Vision✓ Function Calling✗ Reasoning✗ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output
Specifications
| Model Key | deepinfra/meta-llama/Llama-3.2-3B-Instruct |
| Provider | |
| Provider ID | deepinfra |
| Mode | Text |
| Canonical Name | llama-3.2-3b |
| Context Window | 131K tokens |
| Max Output | 131K tokens |
Pricing
| Type | Per 1K Tokens | Per 1M Tokens |
|---|---|---|
| Input Tokens | 0.000020 | 0.020 |
| Output Tokens | 0.000020 | 0.020 |
Price Comparison by Provider
Compare prices for Llama 3.2 3B Instruct across different providers. The same model may be available through multiple providers at different price points.
Provider | Model Key | Input Price, $ | Output Price, $ |
|---|---|---|---|
| watsonx/meta-llama/llama-3-2-3b-instruct | 0.150 | 0.150 | |
| vercel_ai_gateway/meta/llama-3.2-3b | 0.150 | 0.150 | |
| together_ai/meta-llama/Llama-3.2-3B-Instruct-Turbo | N/A | N/A | |
| snowflake/llama3.2-3b | N/A | N/A | |
| sambanova/Meta-Llama-3.2-3B-Instruct | 0.080 | 0.160 | |
| novita/meta-llama/llama-3.2-3b-instruct | 0.030 | 0.050 | |
| meta.llama3-2-3b-instruct-v1:0 | 0.150 | 0.150 | |
| LlamaGate | llamagate/llama-3.2-3b | 0.040 | 0.080 |
| lambda_ai/llama3.2-3b-instruct | 0.015 | 0.025 | |
| hyperbolic/meta-llama/Llama-3.2-3B-Instruct | 0.120 | 0.300 | |
| fireworks_ai/accounts/fireworks/models/llama-v3p2-3b | 0.100 | 0.100 | |
| deepinfra/meta-llama/Llama-3.2-3B-Instruct | 0.020 | 0.020 |
All Variants
All available versions, regions, and API endpoints for Llama 3.2 3B Instruct.
Model Key | Provider | Mode | Input Price, $ | Output Price, $ | Context | Max Output | Vision | Functions |
|---|---|---|---|---|---|---|---|---|
| eu.meta.llama3-2-3b-instruct-v1:0 | Text | 0.190 | 0.190 | 128K | 4K | no | yes | |
| meta.llama3-2-3b-instruct-v1:0 | Text | 0.150 | 0.150 | 128K | 4K | no | yes | |
| us.meta.llama3-2-3b-instruct-v1:0 | Text | 0.150 | 0.150 | 128K | 4K | no | yes | |
| deepinfra/meta-llama/Llama-3.2-3B-Instruct | Text | 0.020 | 0.020 | 131K | 131K | no | yes | |
| fireworks_ai/accounts/fireworks/models/llama-v3p2-3b | Text | 0.100 | 0.100 | 131K | 131K | no | no | |
| fireworks_ai/accounts/fireworks/models/llama-v3p2-3b-instruct | Text | 0.100 | 0.100 | 16K | 16K | no | no | |
| hyperbolic/meta-llama/Llama-3.2-3B-Instruct | Text | 0.120 | 0.300 | 33K | 33K | no | yes | |
| watsonx/meta-llama/llama-3-2-3b-instruct | Text | 0.150 | 0.150 | 128K | 128K | no | yes | |
| lambda_ai/llama3.2-3b-instruct | Text | 0.015 | 0.025 | 131K | 131K | no | yes | |
| llamagate/llama-3.2-3b | LlamaGate | Text | 0.040 | 0.080 | 131K | 8K | no | yes |
| novita/meta-llama/llama-3.2-3b-instruct | Text | 0.030 | 0.050 | 33K | 32K | no | yes | |
| sambanova/Meta-Llama-3.2-3B-Instruct | Text | 0.080 | 0.160 | 4K | 4K | no | no | |
| snowflake/llama3.2-3b | Text | N/A | N/A | 128K | 8K | no | no | |
| together_ai/meta-llama/Llama-3.2-3B-Instruct-Turbo | Text | N/A | N/A | N/A | N/A | no | yes | |
| vercel_ai_gateway/meta/llama-3.2-3b | Text | 0.150 | 0.150 | 128K | 8K | no | yes |