GPT-oss-120b

GPT-oss-120b is a text model from Baseten. Pricing starts at 0.10 per million input tokens and 0.50 per million output tokens (cheapest at Lemonade (AMD)).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keybaseten/openai/gpt-oss-120b
ProviderBaseten
Provider IDbaseten
ModeText
Canonical Namegpt-oss-120b
Context WindowN/A tokens
Max OutputN/A

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0001000.100
Output Tokens0.0005000.500

Benchmarks

Intelligence Index33.3#39
Coding Index28.6#45
Math Index93.4#10
MMLU-Pro0.8#45
GPQA0.8#41
HLE0.2#30
LiveCodeBench0.9#5
IFBench0.7#27
Time to First Token0.54s#123
SciCode0.4#42
AIME 20250.9#10
LCR0.5#50
TerminalBench Hard0.2#44
TAU20.7#45

Price Comparison by Provider

Compare prices for GPT-oss-120b across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
IBM watsonx logoIBM watsonxwatsonx/openai/gpt-oss-120b0.1500.600
Weights & Biases logoWeights & Biaseswandb/openai/gpt-oss-120b0.0150.060
Vertex AI (OpenAI) logoVertex AI (OpenAI)vertex_ai/openai/gpt-oss-120b-maas0.1500.600
Together AI logoTogether AItogether_ai/openai/gpt-oss-120b0.1500.600
SambaNova logoSambaNovasambanova/gpt-oss-120b3.004.50
Replicate logoReplicatereplicate/openai/gpt-oss-120b0.1800.720
OVHcloud logoOVHcloudovhcloud/gpt-oss-120b0.0800.400
OpenRouter logoOpenRouteropenrouter/openai/gpt-oss-120b0.1800.800
AWS Bedrock logoAWS Bedrockopenai.gpt-oss-120b-1:00.1500.600
Ollama logoOllamaollama/gpt-oss:120b-cloudN/AN/A
Novita AI logoNovita AInovita/openai/gpt-oss-120b0.0500.250
Lemonade (AMD) logoLemonade (AMD)lemonade/gpt-oss-120b-mxfp-GGUFN/AN/A
Groq logoGroqgroq/openai/gpt-oss-120b0.1500.600
Fireworks AI logoFireworks AIfireworks_ai/accounts/fireworks/models/gpt-oss-120b0.1500.600
DeepInfra logoDeepInfradeepinfra/openai/gpt-oss-120b0.0500.450
Databricks logoDatabricksdatabricks/databricks-gpt-oss-120b0.1500.600
Cerebras logoCerebrascerebras/gpt-oss-120b0.3500.750
AWS Bedrock logoAWS Bedrockbedrock_mantle/openai.gpt-oss-120b0.1500.600
Basetenbaseten/openai/gpt-oss-120b0.1000.500
Azure AI logoAzure AIazure_ai/gpt-oss-120b0.1500.600

All Variants

All available versions, regions, and API endpoints for GPT-oss-120b.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
bedrock_mantle/openai.gpt-oss-120bAWS Bedrock logoAWS BedrockText0.1500.600131K33Knoyes
openai.gpt-oss-120b-1:0AWS Bedrock logoAWS BedrockText0.1500.600128K128Knoyes
azure_ai/gpt-oss-120bAzure AI logoAzure AIText0.1500.600131K131Knoyes
baseten/openai/gpt-oss-120bBasetenText0.1000.500N/AN/Anono
cerebras/gpt-oss-120bCerebras logoCerebrasText0.3500.750131K33Knoyes
databricks/databricks-gpt-oss-120bDatabricks logoDatabricksText0.1500.600131K131Knono
deepinfra/openai/gpt-oss-120bDeepInfra logoDeepInfraText0.0500.450131K131Knoyes
fireworks_ai/accounts/fireworks/models/gpt-oss-120bFireworks AI logoFireworks AIText0.1500.600131K131Knoyes
groq/openai/gpt-oss-120bGroq logoGroqText0.1500.600131K33Knoyes
watsonx/openai/gpt-oss-120bIBM watsonx logoIBM watsonxText0.1500.6008K8Knono
lemonade/gpt-oss-120b-mxfp-GGUFLemonade (AMD) logoLemonade (AMD)TextN/AN/A131K33Knoyes
novita/openai/gpt-oss-120bNovita AI logoNovita AIText0.0500.250131K33Kyesyes
ollama/gpt-oss:120b-cloudOllama logoOllamaTextN/AN/A131K131Knoyes
openrouter/openai/gpt-oss-120bOpenRouter logoOpenRouterText0.1800.800131K33Knoyes
ovhcloud/gpt-oss-120bOVHcloud logoOVHcloudText0.0800.400131K131Knono
replicate/openai/gpt-oss-120bReplicate logoReplicateText0.1800.720N/AN/Anoyes
sambanova/gpt-oss-120bSambaNova logoSambaNovaText3.004.50131K131Knoyes
together_ai/openai/gpt-oss-120bTogether AI logoTogether AIText0.1500.600131K131Knoyes
vertex_ai/openai/gpt-oss-120b-maasVertex AI (OpenAI) logoVertex AI (OpenAI)Text0.1500.600131K33Knono
wandb/openai/gpt-oss-120bWeights & Biases logoWeights & BiasesText0.0150.060131K131Knono