GPT-4o Transcribe Audio is OpenAI's language model, starting at $N/A / 1M input and $N/A / 1M output. The audio-input endpoint of GPT-4o Transcribe for submitting audio data to the speech-to-text pipeline.
Capabilities
Input1/5
✓
·
·
·
·
Output1/5
✓
·
·
·
·
Capabilities0/13
·
·
·
·
·
·
·
·
·
·
·
·
·
Pricing by Provider
| Provider | Standard |
|---|---|
| Audio In $ / 1M | |
Azure AI Foundry | $6.00 |
Provider-specific pricing that varies by region.
Azure AI Foundry
13 regions
| Region | Standard |
|---|---|
| Audio In $ / 1M | |
| Global | |
| Global | $6.00 |
| Europe | |
| France Central (Paris)/ francecentral | $6.60 |
| Germany West Central (Frankfurt)/ germanywestcentral | $6.60 |
| Poland Central (Warsaw)/ polandcentral | $6.60 |
| Spain Central (Madrid)/ spaincentral | $6.60 |
| Sweden Central (Gävle)/ swedencentral | $7.26 |
| West Europe (Netherlands)/ westeurope | $6.60 |
| US | |
| East US (Virginia)/ eastus | $6.60 |
| East US 2 (Virginia)/ eastus2 | $6.60 |
| North Central US (Illinois)/ northcentralus | $6.60 |
| South Central US (Texas)/ southcentralus | $6.60 |
| West US (California)/ westus | $6.60 |
| West US 3 (Phoenix)/ westus3 | $6.60 |
Cost Calculator
Preset:
Compares every provider & tier in USD
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| GPT-5.5 | 1.1M | $5.00 | $30.00 | Available | |
| GPT-5.4 Mini | 1.1M | $0.750 | $4.50 | Available | |
| GPT-5.4 Nano | 1.1M | $0.200 | $1.25 | Available | |
| GPT-5.4 | 1.1M | $2.50 | $15.00 | Available | |
| GPT-5.3 Codex | 400K | $1.75 | $14.00 | Available | |
| GPT-5.2 Codex | 400K | $1.75 | $14.00 | Available | |
| GPT-5.2 | 410K | $1.75 | $14.00 | Available | |
| GPT-5.1 | 410K | $1.25 | $10.00 | Available | |
| GPT-5.1 Codex | 400K | $1.25 | $10.00 | Available | |
| GPT-5.1 Codex Mini | 400K | $0.250 | $2.00 | Available | |
| GPT-4o Transcribe Audio | — | — | $0.000 | $0.000 | Current |