GPT-oss-120b
ReplicateText
GPT-oss-120b is a text model from Replicate. Pricing starts at $0.18 per million input tokens and $0.72 per million output tokens (cheapest at Lemonade).
Specifications
| Model Key | replicate/openai/gpt-oss-120b |
| Provider | Replicate |
| LiteLLM Provider | replicate |
| Mode | Text |
| Canonical Name | gpt-oss-120b |
| Context Window | N/A tokens |
| Max Output | N/A |
Capabilities
✗ Vision✓ Function Calling✗ Reasoning✗ JSON Schema✓ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output
Pricing
| Type | Per 1K Tokens | Per 1M Tokens |
|---|---|---|
| Input Tokens | $0.000180 | $0.180 |
| Output Tokens | $0.000720 | $0.720 |
Price Comparison by Provider
Compare prices for GPT-oss-120b across different providers. The same model may be available through multiple providers at different price points.
Provider | Model Key | Input Price | Output Price |
|---|---|---|---|
| Databricks | databricks/databricks-gpt-oss-120b | $0.150 | $0.600 |
| Azure AI | azure_ai/gpt-oss-120b | $0.150 | $0.600 |
| Cerebras | cerebras/gpt-oss-120b | $0.350 | $0.750 |
| Deepinfra | deepinfra/openai/gpt-oss-120b | $0.050 | $0.450 |
| Fireworks AI | fireworks_ai/accounts/fireworks/models/gpt-oss-120b | $0.150 | $0.600 |
| Groq | groq/openai/gpt-oss-120b | $0.150 | $0.600 |
| Novita | novita/openai/gpt-oss-120b | $0.050 | $0.250 |
| OpenRouter | openrouter/openai/gpt-oss-120b | $0.180 | $0.800 |
| Ovhcloud | ovhcloud/gpt-oss-120b | $0.080 | $0.400 |
| Replicate | replicate/openai/gpt-oss-120b | $0.180 | $0.720 |
| SambaNova | sambanova/gpt-oss-120b | $3.00 | $4.50 |
| Together AI | together_ai/openai/gpt-oss-120b | $0.150 | $0.600 |
| Wandb | wandb/openai/gpt-oss-120b | $0.015 | $0.060 |
| Watsonx | watsonx/openai/gpt-oss-120b | $0.150 | $0.600 |
| AWS Bedrock | openai.gpt-oss-120b-1:0 | $0.150 | $0.600 |
| OpenAI | vertex_ai/openai/gpt-oss-120b-maas | $0.150 | $0.600 |
| Lemonade | lemonade/gpt-oss-120b-mxfp-GGUF | N/A | N/A |
| Ollama | ollama/gpt-oss:120b-cloud | N/A | N/A |
Similar Models
Models with similar capabilities and context window size.
Model | Provider | Mode | Input Price | Output Price | Context | Max Output | Vision | Functions |
|---|---|---|---|---|---|---|---|---|
| DeepSeek R1 Distill Llama 8B | Nscale | Text | $0.025 | $0.025 | N/A | N/A | no | no |
| DeepSeek R1 Distill Qwen 14B | Nscale | Text | $0.070 | $0.070 | N/A | N/A | no | no |
| GPT-5 nano | Replicate | Text | $0.050 | $0.400 | N/A | N/A | no | yes |
| Granite 3.3 8B Instruct | Replicate | Text | $0.030 | $0.250 | N/A | N/A | no | yes |
| Llama 3.1 8B Instruct | Nscale | Text | $0.030 | $0.030 | N/A | N/A | no | no |
| Llama 3.3 70B Instruct Turbo Free | Together AI | Text | N/A | N/A | N/A | N/A | no | yes |
| Qwen2.5 Coder 32B Instruct | Nscale | Text | $0.060 | $0.200 | N/A | N/A | no | no |
| Qwen2.5 Coder 3B Instruct | Nscale | Text | $0.010 | $0.030 | N/A | N/A | no | no |
| Qwen2.5 Coder 7B Instruct | Nscale | Text | $0.010 | $0.030 | N/A | N/A | no | no |
| Titan Embed Text V2 | Vercel Ai Gateway | Text | $0.020 | N/A | N/A | N/A | no | no |