Llama3.2 NV RerankQA 2 1B is NVIDIA's reranking model. A 1B-parameter reranking model fine-tuned by NVIDIA on Llama3.2 for multilingual and cross-lingual question-answering retrieval with long-context support.
Specifications
Canonical IDnvidia-llama3-2-nv-rerankqa-2-1b
TypeReranking
StatusActive
CreatorNVIDIANVIDIA
Input ModalitiesText
Parameters1B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output0/5
Text·
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama 3.3 70B Instruct131K$0.100$0.200Available
Llama 3.2 3B Instruct131K$0.015$0.020Deprecated
Llama 3.2 1B Instruct131K$0.027$0.080Deprecated
Llama 3.1 405B Instruct131K$0.120$0.300Deprecating
Llama 3.1 70B Instruct131K$0.100$0.100Available
Llama 3.1 8B Instruct200K$0.020$0.030Available
Llama 3.1 70B128K$0.600$0.600Available
Llama 3.1 8B131K$0.030$0.050Available
Llama 3 70B Instruct131K$0.120$0.300Available
Llama 3 8B Instruct32K$0.030$0.040Available
Llama3.2 NV RerankQA 2 1BCurrent

Model IDs