Data residency

Data stored at rest in the customer selected location remains at rest in that location, independent of the Generative AI on Vertex AI endpoint called by that customer's request.

ML processing

Machine learning (ML) processing for Generative AI on Vertex AI services occurs within the specific region or multi-region where the request is made.

For any regional endpoint not explicitly listed in the following tables, such as those in the Middle East, there is no guarantee that ML processing occurs at a specific location. These endpoints support older models that don't offer ML processing guarantees.

Google Cloud model support

Multi-region

Model US multi-region EU multi-region
Gemini 2.5 Flash, 128k(gemini-2.5-flash)
Gemini 2.5 Flash, 1M(gemini-2.5-flash)
Gemini 2.5 Flash Image(gemini-2.5-flash-image)
Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite)
Gemini 2.5 Pro(gemini-2.5-pro)
Tuning for Gemini 2.5 Flash(gemini-2.5-flash)
Tuning for Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite)
Tuning for Gemini 2.5 Pro(gemini-2.5-pro)
Gemini 2.0 Flash(gemini-2.0-flash-001)
Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001)
Tuning for Gemini 2.0 Flash(gemini-2.0-flash-001)
Tuning for Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001)
Gemini Embeddings(gemini-embedding-001)
Chirp 2: Transcription(chirp_2)
Chirp 3: Transcription(chirp_3)
Chirp 3: HD Voices
Chirp 3: Instant Custom Voice
Imagen 2(imagegeneration@005)
Embeddings for Multimodal
Embeddings for Text(text-embedding-004)
Embeddings for Text(text-embedding-005)
Embeddings for Text(text-multilingual-embedding-002)

Canada

Model Montréal(northamerica-northeast1)
Gemini 2.5 Flash, 128k(gemini-2.5-flash)
Gemini 2.5 Flash, 1M(gemini-2.5-flash)
Gemini 2.5 Flash Image(gemini-2.5-flash-image)
Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite)
Gemini 2.5 Pro(gemini-2.5-pro)
Tuning for Gemini 2.5 Flash(gemini-2.5-flash)
Tuning for Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite)
Tuning for Gemini 2.5 Pro(gemini-2.5-pro)
Gemini 2.0 Flash(gemini-2.0-flash-001)
Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001)
Tuning for Gemini 2.0 Flash(gemini-2.0-flash-001)
Tuning for Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001)
Gemini Embeddings(gemini-embedding-001)
Chirp 2: Transcription(chirp_2)
Chirp 3: Transcription(chirp_3)
Chirp 3: HD Voices
Chirp 3: Instant Custom Voice
Imagen 2(imagegeneration@005)
Embeddings for Multimodal
Embeddings for Text(text-embedding-004)
Embeddings for Text(text-embedding-005)
Embeddings for Text(text-multilingual-embedding-002)

Europe

Model Paris(europe-west9) London(europe-west2) Frankfurt(europe-west3) Netherlands(europe-west4)
Gemini 2.5 Flash, 128k(gemini-2.5-flash)
Gemini 2.5 Flash, 1M(gemini-2.5-flash)
Gemini 2.5 Flash Image(gemini-2.5-flash-image)
Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite)
Gemini 2.5 Pro(gemini-2.5-pro)
Tuning for Gemini 2.5 Flash(gemini-2.5-flash)
Tuning for Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite)
Tuning for Gemini 2.5 Pro(gemini-2.5-pro)
Gemini 2.0 Flash(gemini-2.0-flash-001)
Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001)
Tuning for Gemini 2.0 Flash(gemini-2.0-flash-001)
Tuning for Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001)
Gemini Embeddings(gemini-embedding-001)
Chirp 2: Transcription(chirp_2)
Chirp 3: Transcription(chirp_3)
Chirp 3: HD Voices
Chirp 3: Instant Custom Voice
Imagen 2(imagegeneration@005)
Embeddings for Multimodal
Embeddings for Text(text-embedding-004)
Embeddings for Text(text-embedding-005)
Embeddings for Text(text-multilingual-embedding-002)

Asia Pacific

Model Tokyo(asia-northeast1) Sydney(australia-southeast1) Mumbai(asia-south1) Singapore(asia-southeast1) Seoul(asia-northeast3)
Gemini 2.5 Flash, 128k(gemini-2.5-flash)
Gemini 2.5 Flash, 1M(gemini-2.5-flash)
Gemini 2.5 Flash Image(gemini-2.5-flash-image)
Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite)
Gemini 2.5 Pro(gemini-2.5-pro)
Tuning for Gemini 2.5 Flash(gemini-2.5-flash)
Tuning for Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite)
Tuning for Gemini 2.5 Pro(gemini-2.5-pro)
Gemini 2.0 Flash(gemini-2.0-flash-001)
Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001)
Tuning for Gemini 2.0 Flash(gemini-2.0-flash-001)
Tuning for Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001)
Gemini Embeddings(gemini-embedding-001)
Chirp 2: Transcription(chirp_2)
Chirp 3: Transcription(chirp_3)
Chirp 3: HD Voices
Chirp 3: Instant Custom Voice
Imagen 2(imagegeneration@005)
Embeddings for Multimodal
Embeddings for Text(text-embedding-004)
Embeddings for Text(text-embedding-005)
Embeddings for Text(text-multilingual-embedding-002)

Google Cloud partner model support

Multi-region

Model US multi-region EU multi-region
Anthropic's Claude Haiku 4.5
Anthropic's Claude Opus 4
Anthropic's Claude Opus 4.1
Anthropic's Claude Sonnet 4
Anthropic's Claude Sonnet 4.5
Anthropic's Claude 3.5 Haiku
Anthropic's Claude 3 Haiku
Anthropic's Claude 3.7 Sonnet (deprecated)
Codestral (24.05)
Codestral 2
Mistral Large (24.07)
Mistral Medium 3
Mistral OCR (25.05)
Mistral Small 3.1 (25.03)

Europe

Model Belgium(europe-west1) Netherlands(europe-west4)
Anthropic's Claude Haiku 4.5
Anthropic's Claude Opus 4
Anthropic's Claude Opus 4.1
Anthropic's Claude Sonnet 4
Anthropic's Claude Sonnet 4.5
Anthropic's Claude 3.5 Haiku
Anthropic's Claude 3 Haiku
Anthropic's Claude 3.7 Sonnet (deprecated)
Codestral (24.05)
Codestral 2
Mistral Large (24.07)
Mistral Medium 3
Mistral OCR (25.05)
Mistral Small 3.1 (25.03)

Asia Pacific

Model Singapore(asia-southeast1) Taiwan(asia-east1)
Anthropic's Claude Haiku 4.5
Anthropic's Claude Opus 4
Anthropic's Claude Opus 4.1
Anthropic's Claude Sonnet 4
Anthropic's Claude Sonnet 4.5
Anthropic's Claude 3.5 Haiku
Anthropic's Claude 3 Haiku
Anthropic's Claude 3.7 Sonnet (deprecated)
Codestral (24.05)
Codestral 2
Mistral Large (24.07)
Mistral Medium 3
Mistral OCR (25.05)
Mistral Small 3.1 (25.03)

Google Cloud open model support

Multi-region

Model US multi-region EU multi-region
DeepSeek-OCR
DeepSeek R1 (0528)
DeepSeek-V3.1
gpt-oss 120B
gpt-oss 20B
Llama 3.1 70B (Preview)
Llama 3.1 8B (Preview)
Llama 3.2 90B (Preview)
Llama 3.3 70B (Preview)
Llama 3.1 405B
Llama 4 Maverick 17B-128E (Preview)
Llama 4 Scout 17B-16E (Preview)
MiniMax M2
Multilingual E5 Large
Multilingual E5 Small
Qwen3 235B
Qwen3 Coder
Qwen3-Next-80B Instruct
Qwen3-Next-80B Thinking

Europe

Model Belgium(europe-west1) Netherlands(europe-west4)
DeepSeek-OCR
DeepSeek R1 (0528)
DeepSeek-V3.1
gpt-oss 120B
gpt-oss 20B
Llama 3.1 70B (Preview)
Llama 3.1 8B (Preview)
Llama 3.2 90B (Preview)
Llama 3.3 70B (Preview)
Llama 3.1 405B
Llama 4 Maverick 17B-128E (Preview)
Llama 4 Scout 17B-16E (Preview)
MiniMax M2
Multilingual E5 Large
Multilingual E5 Small
Qwen3 235B
Qwen3 Coder
Qwen3-Next-80B Instruct
Qwen3-Next-80B Thinking

Asia Pacific

Model Singapore(asia-southeast1) Taiwan(asia-east1)
DeepSeek-OCR
DeepSeek R1 (0528)
DeepSeek-V3.1
gpt-oss 120B
gpt-oss 20B
Llama 3.1 70B (Preview)
Llama 3.1 8B (Preview)
Llama 3.2 90B (Preview)
Llama 3.3 70B (Preview)
Llama 3.1 405B
Llama 4 Maverick 17B-128E (Preview)
Llama 4 Scout 17B-16E (Preview)
MiniMax M2
Multilingual E5 Large
Multilingual E5 Small
Qwen3 235B
Qwen3 Coder
Qwen3-Next-80B Instruct
Qwen3-Next-80B Thinking

What's next