Qwen3 32B

SambaNovaText

Qwen3 32B is a text model from SambaNova with a context window of 8K tokens and max output of 8K tokens. Pricing starts at $0.40 per million input tokens and $0.80 per million output tokens (cheapest at Lambda Ai).

Specifications

Model Keysambanova/Qwen3-32B
ProviderSambaNova
LiteLLM Providersambanova
ModeText
Canonical Nameqwen-3-32b
Context Window8K tokens
Max Output8K tokens

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens$0.000400$0.400
Output Tokens$0.000800$0.800

Price Comparison by Provider

Compare prices for Qwen3 32B across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price
Output Price
Gradient Aigradient_ai/alibaba-qwen3-32bN/AN/A
Cerebrascerebras/qwen-3-32b$0.400$0.800
Vercel Ai Gatewayvercel_ai_gateway/alibaba/qwen-3-32b$0.100$0.300
Deepinfradeepinfra/Qwen/Qwen3-32B$0.100$0.280
Fireworks AIfireworks_ai/accounts/fireworks/models/qwen3-32b$0.900$0.900
Groqgroq/qwen/qwen3-32b$0.290$0.590
Ovhcloudovhcloud/Qwen3-32B$0.080$0.230
SambaNovasambanova/Qwen3-32B$0.400$0.800
Lambda Ailambda_ai/qwen3-32b-fp8$0.050$0.100
Novitanovita/qwen/qwen3-32b-fp8$0.100$0.450
AWS Bedrockqwen.qwen3-32b-v1:0$0.150$0.600

Similar Models

Models with similar capabilities and context window size.

Model
Provider
Mode
Input Price
Output Price
Context
Max Output
Vision
Functions
ALIA 40B Instruct Q8 0PublicaiTextN/AN/A8K4Knoyes
Apertus 70B InstructPublicaiTextN/AN/A8K4Knoyes
Apertus 8B InstructPublicaiTextN/AN/A8K4Knoyes
Gemma SEA LION V4 27B ITPublicaiTextN/AN/A8K4Knoyes
Llama 3 8B Instruct:freeOpenRouterTextN/AN/A8KN/Anono
Llama3OllamaTextN/AN/A8K8Knono
Llama3:70BOllamaTextN/AN/A8K8Knono
Llama3.1OllamaTextN/AN/A8K8Knoyes
Mistral 7B Instruct:freeOpenRouterTextN/AN/A8KN/Anono
Sarvam MSarvamTextN/AN/A8K32Knono