Allen AI logo

Molmo 7B-D


Molmo 7B-D is Allen AI's language model. A 7B open vision-language model from the Allen Institute for AI, featuring strong visual grounding and pointing capabilities in a mid-size architecture.
Specifications
Canonical IDallenai-molmo-7b-d
TypeLanguage
StatusActive
CreatorAllen AIAllen AI
Input ModalitiesText
Output ModalitiesText
Parameters7B
Benchmarks
Intelligence Index
9.2
#401
Coding Index
1.2
#372
Math Index
0.0
#258
MMLU-Pro
0.4
#304
GPQA
0.2
#431
HLE
0.1
#262
LiveCodeBench
0.0
#316
IFBench
0.2
#379
Time to First Token
SciCode
0.0
#420
AIME 2025
0.0
#258
LCR
0.0
#340
TerminalBench Hard
0.0
#336
TAU2
0.0
#356
Output TPS
0.0
#286

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Model IDs