Name: Rolm OCR
Brand: Rolm

Rolm OCR is Rolm's image to text model with a 128K context window, starting at $0.2 / 1M input and $0.2 / 1M output. An open-source document OCR model built on Qwen2.5-VL-7B-Instruct by Reducto AI, offering faster performance and reduced memory usage as a drop-in alternative to olmOCR.

Specifications
Canonical ID	`rolm-ocr`
Type	Image to Text
Status	Active
Creator	Rolm
Providers	Fireworks AI
Context Window	128K tokens
Input Modalities	Text
Output Modalities	Text

Capabilities

Input1/5

Text✓

Image·

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities0/13

Reasoning·

Adaptive Reasoning·

Function Calling·

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Pricing by Provider

US Dollar ($)

Per 1M tokens

Provider	Standard
Provider	Input $ / 1M	Output $ / 1M
Fireworks AI `fireworks_ai/accounts/fireworks/models/rolm-ocr`	$0.2	$0.2

Cost Calculator

US Dollar ($)

Preset:

Input tokens

Output tokens

Number of calls

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
Voyage Multimodal 3.5	—	—	—	—	Available
Qwen2.5 VL 72B Instruct	2025-02-01	131K	$0.130	$0.400	Available
Rolm OCR	—	128K	$0.200	$0.200	Current
Qwen2.5 VL 32B Instruct	—	128K	$0.200	$0.600	Available
Qwen2.5 VL 3B Instruct	—	128K	$0.200	$0.200	Available
Qwen2.5 VL 7B Instruct	—	128K	$0.200	$0.200	Available

Model IDs

accounts/fireworks/models/rolm-ocr

fireworks_ai/accounts/fireworks/models/rolm-ocr

rolm-ocr

Rolm OCR

CapabilitiesAPIGET/api/v1/models/rolm-ocr

Pricing by ProviderAPIGET/api/v1/models/rolm-ocr/pricing

Cost CalculatorAPIGET/api/v1/models/rolm-ocr/pricing/calculate?input_tokens=1000000&output_tokens=500000

VersionsAPIGET/api/v1/models?family=qwen2_5_vl

Model IDsAPIGET/api/v1/models/rolm-ocr

Capabilities

Pricing by Provider

Cost Calculator

Versions

Model IDs