Google: Gemini Embedding 2 Preview

google/gemini-embedding-2-preview

Gemini Embedding 2 Preview is Google's first multimodal embedding model. We currently support mapping text and images into a unified vector space for semantic search and retrieval-augmented generation (RAG). It supports input context up to 8,192 tokens and flexible output dimensions from 128 to 3,072 (recommended: 768, 1536, or 3,072). Designed for cross-modal similarity — you can embed a text query and retrieve the most relevant images, or vice versa — making it well-suited for multimodal search, recommendation, and document understanding pipelines.

Modalities

Price

$0.20

Context

8K

Weekly Rank

#184on OpenRouter

Google: Gemini Embedding 2 Preview