Deepgram logo

Base Video


Base Video is Deepgram logoDeepgram's speech to text model, starting at $N/A / 1M input and $N/A / 1M output. A Deepgram ASR model optimized for transcribing video content at base-tier accuracy.
Spec
Canonical IDdeepgram-base-video
TypeSpeech to Text
StatusActive
CreatorDeepgramDeepgram
Providers
Input ModalitiesAudio
Output ModalitiesText

Capabilities

Input1/5
Text·
Image·
Audio
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Audio In
$ / 1M
Deepgram logo
Deepgram
base-video
$0.000208

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Base Video$0.000$0.000Current
Base ConversationalAIAvailable
Base FinanceAvailable
Base GeneralAvailable
Base MeetingAvailable
Base Phonecall$0.000$0.000Available
Base Voicemail$0.000$0.000Available
Deepgram Base$0.000$0.000Available
Deepgram ConversationalAI$0.000$0.000Available
Deepgram Enhanced$0.000$0.000Available
Deepgram Finance$0.000$0.000Available

Model IDs