Qianfan OCR Fast is Baidu's image to text model with a 66K context window and up to 29K output tokens. A domain-specific multimodal OCR model on Baidu's Qianfan platform, purpose-built for fast and accurate text recognition from images.
| Specifications | |
|---|---|
baidu-ocr-fast | |
| Image to Text | |
| Deprecated | |
| 66K tokens | |
| 29K tokens | |
| ImageText | |
| Text | |
| default | |
| · 2 months ago | |
Capabilities
Input2/5
Text✓
Image✓
Audio·
Video·
PDF·
Output1/5
Text✓
Image·
Audio·
Video·
Embedding·
Capabilities1/13
Reasoning✓
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·
Pricing by Provider
US Dollar ($)
Per 1M tokens
| Provider | Standard | |
|---|---|---|
| Input $ / 1M | Output $ / 1M | |
| $0.68 | $2.81 | |
Cost Calculator
US Dollar ($)
Preset:
Other Models
| Model | Tier | Released | Context | Input / 1M | Output / 1M |
|---|---|---|---|---|---|
| Mistral OCR 4 | — | — | — | — | — |
| Mistral OCR 4 Annot | — | — | — | — | — |
| DeepSeek-OCR 2 | — | — | — | — | — |
| DeepSeek-OCR | — | — | — | — | — |
| Document OCR | — | — | — | — | — |
| Mistral OCR | — | — | — | $2.00 | $3.00 |
| OCR | — | — | — | — | — |
| Prebuilt Document | — | — | — | — | — |
| Prebuilt Layout | — | — | — | — | — |
| Prebuilt Read | — | — | — | — | — |