Qwen3 Reranker 8B

Qwen3 Reranker 8B is a rerank model from Novita AI with a context window of 33K tokens and max output of 4K tokens. Pricing starts at 0.05 per million input tokens and 0.05 per million output tokens (cheapest at Fireworks AI).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keynovita/qwen/qwen3-reranker-8b
ProviderNovita AI
Provider IDnovita
ModeRerank
Canonical Nameqwen-3-8b
Context Window33K tokens
Max Output4K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0000500.050
Output Tokens0.0000500.050

Benchmarks

Intelligence Index10.6#179
Coding Index7.1#150
Math Index24.3#92
MMLU-Pro0.6#135
GPQA0.5#152
HLE0.0#220
LiveCodeBench0.2#144
AIME0.2#57
IFBench0.3#145
Time to First Token0.98s#165
SciCode0.2#175
MATH-5000.8#63
AIME 20250.2#92
LCR0.0#151
TerminalBench Hard0.0#124
TAU20.2#105

Price Comparison by Provider

Compare prices for Qwen3 Reranker 8B across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
Novita AInovita/qwen/qwen3-8b-fp80.0350.138
LlamaGatellamagate/qwen3-8b0.0400.140
Fireworks AIfireworks_ai/accounts/fireworks/models/qwen3-reranker-8bN/AN/A

All Variants

All available versions, regions, and API endpoints for Qwen3 Reranker 8B.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
fireworks_ai/accounts/fireworks/models/qwen3-8bFireworks AIText0.2000.20041K41Knono
fireworks_ai/accounts/fireworks/models/qwen3-reranker-8bFireworks AIRerankN/AN/A41K41Knono
llamagate/qwen3-8bLlamaGateText0.0400.14033K8Knoyes
novita/qwen/qwen3-8b-fp8Novita AIText0.0350.138128K20Knono
novita/qwen/qwen3-embedding-8bNovita AIEmbedding0.070N/A33K4Knono
novita/qwen/qwen3-reranker-8bNovita AIRerank0.0500.05033K4Knono