Qianfan OCR Fast is Baidu's image to text model with a 66K context window and up to 29K output tokens. A domain-specific multimodal OCR model on Baidu's Qianfan platform, purpose-built for fast and accurate text recognition from images.
Specifications
Canonical IDbaidu-ocr-fast
TypeImage to Text
StatusDeprecated
CreatorBaiduBaidu
Providers
Context Window66K tokens
Max Output29K tokens
Input ModalitiesImageText
Output ModalitiesText
Reasoning Effortsdefault
Release Date · 2 months ago
Deprecation Date

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities1/13
Reasoning
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Input
$ / 1M
Output
$ / 1M
OpenRouter logo
OpenRouter
baidu/qianfan-ocr-fast
$0.68$2.81

Cost Calculator

US Dollar ($)
Preset:

Other Models

ModelTierReleasedContextInput / 1MOutput / 1M
Mistral OCR 4
Mistral OCR 4 Annot
DeepSeek-OCR 2
DeepSeek-OCR
Document OCR
Mistral OCR$2.00$3.00
OCR
Prebuilt Document
Prebuilt Layout
Prebuilt Read

Model IDs

baidu-ocr-fast
baidu/qianfan-ocr-fast