Google logo

Universal Sentence Encoder CMLM Base


Universal Sentence Encoder CMLM Base is Google logoGoogle's embedding model. A base-size Universal Sentence Encoder using Conditional Masked Language Modeling, producing sentence embeddings for semantic similarity and retrieval tasks.
Spec
Canonical IDgoogle-universal-sentence-encoder-cmlm-1-base
TypeEmbedding
StatusActive
CreatorGoogleGoogle
Input ModalitiesText
Output ModalitiesEmbedding

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio·
Video·
Embedding
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Universal Sentence Encoder CMLM BaseCurrent
Universal Sentence Encoder CMLM LargeAvailable
Universal Sentence Encoder CMLM BaseAvailable
Universal Sentence Encoder CMLM LargeAvailable

Model IDs