Qianfan OCR Fast is Baidu's image to text model with a 66K context window and up to 29K output tokens, starting at $0.680 / 1M input and $2.81 / 1M output. A domain-specific multimodal OCR model on Baidu's Qianfan platform, purpose-built for fast and accurate text recognition from images.
Specifications
Canonical IDbaidu-ocr-fast
TypeImage to Text
StatusDeprecating
CreatorBaiduBaidu
Providers
Context Window66K tokens
Max Output29K tokens
Input ModalitiesImageText
Output ModalitiesText
Reasoning Effortsdefault
Release Date · 1 month ago
Deprecation Date

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities1/13
Reasoning
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
OpenRouter logo
OpenRouter
baidu/qianfan-ocr-fast
$0.680$2.81

Cost Calculator

Preset:

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
DeepSeek-OCR 2
DeepSeek-OCR8K$0.030$0.030
DeepSeek-OCR
Document OCR
Mistral OCR$2.00$3.00
OCR
Prebuilt Document
Prebuilt Layout
Prebuilt Read

Model IDs