Data stored at rest in the customer selected location remains at rest in that location, independent of the Generative AI on Vertex AI endpoint called by that customer's request.
ML processing
Machine learning (ML) processing for Generative AI on Vertex AI services occurs within the specific region or multi-region where the request is made.
For any regional endpoint not explicitly listed in the following tables, such as those in the Middle East, there is no guarantee that ML processing occurs at a specific location. These endpoints support older models that don't offer ML processing guarantees.
Google Cloud model support
Multi-region
| Model | US multi-region | EU multi-region |
|---|---|---|
Gemini 2.5 Flash, 128k(gemini-2.5-flash) | ||
Gemini 2.5 Flash, 1M(gemini-2.5-flash) | ||
Gemini 2.5 Flash Image(gemini-2.5-flash-image) | ||
Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite) | ||
Gemini 2.5 Pro(gemini-2.5-pro) | ||
Tuning for Gemini 2.5 Flash(gemini-2.5-flash) | ||
Tuning for Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite) | ||
Tuning for Gemini 2.5 Pro(gemini-2.5-pro) | ||
Gemini 2.0 Flash(gemini-2.0-flash-001) | ||
Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001) | ||
Tuning for Gemini 2.0 Flash(gemini-2.0-flash-001) | ||
Tuning for Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001) | ||
Gemini Embeddings(gemini-embedding-001) | ||
Chirp 2: Transcription(chirp_2) | ||
Chirp 3: Transcription(chirp_3) | ||
| Chirp 3: HD Voices | ||
| Chirp 3: Instant Custom Voice | ||
Imagen 2(imagegeneration@005) | ||
| Embeddings for Multimodal | ||
Embeddings for Text(text-embedding-004) | ||
Embeddings for Text(text-embedding-005) | ||
Embeddings for Text(text-multilingual-embedding-002) |
Canada
| Model | Montréal(northamerica-northeast1) |
|---|---|
Gemini 2.5 Flash, 128k(gemini-2.5-flash) | |
Gemini 2.5 Flash, 1M(gemini-2.5-flash) | |
Gemini 2.5 Flash Image(gemini-2.5-flash-image) | |
Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite) | |
Gemini 2.5 Pro(gemini-2.5-pro) | |
Tuning for Gemini 2.5 Flash(gemini-2.5-flash) | |
Tuning for Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite) | |
Tuning for Gemini 2.5 Pro(gemini-2.5-pro) | |
Gemini 2.0 Flash(gemini-2.0-flash-001) | |
Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001) | |
Tuning for Gemini 2.0 Flash(gemini-2.0-flash-001) | |
Tuning for Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001) | |
Gemini Embeddings(gemini-embedding-001) | |
Chirp 2: Transcription(chirp_2) | |
Chirp 3: Transcription(chirp_3) | |
| Chirp 3: HD Voices | |
| Chirp 3: Instant Custom Voice | |
Imagen 2(imagegeneration@005) | |
| Embeddings for Multimodal | |
Embeddings for Text(text-embedding-004) | |
Embeddings for Text(text-embedding-005) | |
Embeddings for Text(text-multilingual-embedding-002) |
Europe
| Model | Paris(europe-west9) | London(europe-west2) | Frankfurt(europe-west3) | Netherlands(europe-west4) |
|---|---|---|---|---|
Gemini 2.5 Flash, 128k(gemini-2.5-flash) | ||||
Gemini 2.5 Flash, 1M(gemini-2.5-flash) | ||||
Gemini 2.5 Flash Image(gemini-2.5-flash-image) | ||||
Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite) | ||||
Gemini 2.5 Pro(gemini-2.5-pro) | ||||
Tuning for Gemini 2.5 Flash(gemini-2.5-flash) | ||||
Tuning for Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite) | ||||
Tuning for Gemini 2.5 Pro(gemini-2.5-pro) | ||||
Gemini 2.0 Flash(gemini-2.0-flash-001) | ||||
Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001) | ||||
Tuning for Gemini 2.0 Flash(gemini-2.0-flash-001) | ||||
Tuning for Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001) | ||||
Gemini Embeddings(gemini-embedding-001) | ||||
Chirp 2: Transcription(chirp_2) | ||||
Chirp 3: Transcription(chirp_3) | ||||
| Chirp 3: HD Voices | ||||
| Chirp 3: Instant Custom Voice | ||||
Imagen 2(imagegeneration@005) | ||||
| Embeddings for Multimodal | ||||
Embeddings for Text(text-embedding-004) | ||||
Embeddings for Text(text-embedding-005) | ||||
Embeddings for Text(text-multilingual-embedding-002) |
Asia Pacific
| Model | Tokyo(asia-northeast1) | Sydney(australia-southeast1) | Mumbai(asia-south1) | Singapore(asia-southeast1) | Seoul(asia-northeast3) |
|---|---|---|---|---|---|
Gemini 2.5 Flash, 128k(gemini-2.5-flash) | |||||
Gemini 2.5 Flash, 1M(gemini-2.5-flash) | |||||
Gemini 2.5 Flash Image(gemini-2.5-flash-image) | |||||
Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite) | |||||
Gemini 2.5 Pro(gemini-2.5-pro) | |||||
Tuning for Gemini 2.5 Flash(gemini-2.5-flash) | |||||
Tuning for Gemini 2.5 Flash-Lite(gemini-2.5-flash-lite) | |||||
Tuning for Gemini 2.5 Pro(gemini-2.5-pro) | |||||
Gemini 2.0 Flash(gemini-2.0-flash-001) | |||||
Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001) | |||||
Tuning for Gemini 2.0 Flash(gemini-2.0-flash-001) | |||||
Tuning for Gemini 2.0 Flash-Lite(gemini-2.0-flash-lite-001) | |||||
Gemini Embeddings(gemini-embedding-001) | |||||
Chirp 2: Transcription(chirp_2) | |||||
Chirp 3: Transcription(chirp_3) | |||||
| Chirp 3: HD Voices | |||||
| Chirp 3: Instant Custom Voice | |||||
Imagen 2(imagegeneration@005) | |||||
| Embeddings for Multimodal | |||||
Embeddings for Text(text-embedding-004) | |||||
Embeddings for Text(text-embedding-005) | |||||
Embeddings for Text(text-multilingual-embedding-002) |
Google Cloud partner model support
Multi-region
| Model | US multi-region | EU multi-region |
|---|---|---|
| Anthropic's Claude Haiku 4.5 | ||
| Anthropic's Claude Opus 4 | ||
| Anthropic's Claude Opus 4.1 | ||
| Anthropic's Claude Sonnet 4 | ||
| Anthropic's Claude Sonnet 4.5 | ||
| Anthropic's Claude 3.5 Haiku | ||
| Anthropic's Claude 3 Haiku | ||
| Anthropic's Claude 3.7 Sonnet (deprecated) | ||
| Codestral (24.05) | ||
| Codestral 2 | ||
| Mistral Large (24.07) | ||
| Mistral Medium 3 | ||
| Mistral OCR (25.05) | ||
| Mistral Small 3.1 (25.03) |
Europe
| Model | Belgium(europe-west1) | Netherlands(europe-west4) |
|---|---|---|
| Anthropic's Claude Haiku 4.5 | ||
| Anthropic's Claude Opus 4 | ||
| Anthropic's Claude Opus 4.1 | ||
| Anthropic's Claude Sonnet 4 | ||
| Anthropic's Claude Sonnet 4.5 | ||
| Anthropic's Claude 3.5 Haiku | ||
| Anthropic's Claude 3 Haiku | ||
| Anthropic's Claude 3.7 Sonnet (deprecated) | ||
| Codestral (24.05) | ||
| Codestral 2 | ||
| Mistral Large (24.07) | ||
| Mistral Medium 3 | ||
| Mistral OCR (25.05) | ||
| Mistral Small 3.1 (25.03) |
Asia Pacific
| Model | Singapore(asia-southeast1) | Taiwan(asia-east1) |
|---|---|---|
| Anthropic's Claude Haiku 4.5 | ||
| Anthropic's Claude Opus 4 | ||
| Anthropic's Claude Opus 4.1 | ||
| Anthropic's Claude Sonnet 4 | ||
| Anthropic's Claude Sonnet 4.5 | ||
| Anthropic's Claude 3.5 Haiku | ||
| Anthropic's Claude 3 Haiku | ||
| Anthropic's Claude 3.7 Sonnet (deprecated) | ||
| Codestral (24.05) | ||
| Codestral 2 | ||
| Mistral Large (24.07) | ||
| Mistral Medium 3 | ||
| Mistral OCR (25.05) | ||
| Mistral Small 3.1 (25.03) |
Google Cloud open model support
Multi-region
| Model | US multi-region | EU multi-region |
|---|---|---|
| DeepSeek-OCR | ||
| DeepSeek R1 (0528) | ||
| DeepSeek-V3.1 | ||
| gpt-oss 120B | ||
| gpt-oss 20B | ||
| Llama 3.1 70B (Preview) | ||
| Llama 3.1 8B (Preview) | ||
| Llama 3.2 90B (Preview) | ||
| Llama 3.3 70B (Preview) | ||
| Llama 3.1 405B | ||
| Llama 4 Maverick 17B-128E (Preview) | ||
| Llama 4 Scout 17B-16E (Preview) | ||
| MiniMax M2 | ||
| Multilingual E5 Large | ||
| Multilingual E5 Small | ||
| Qwen3 235B | ||
| Qwen3 Coder | ||
| Qwen3-Next-80B Instruct | ||
| Qwen3-Next-80B Thinking |
Europe
| Model | Belgium(europe-west1) | Netherlands(europe-west4) |
|---|---|---|
| DeepSeek-OCR | ||
| DeepSeek R1 (0528) | ||
| DeepSeek-V3.1 | ||
| gpt-oss 120B | ||
| gpt-oss 20B | ||
| Llama 3.1 70B (Preview) | ||
| Llama 3.1 8B (Preview) | ||
| Llama 3.2 90B (Preview) | ||
| Llama 3.3 70B (Preview) | ||
| Llama 3.1 405B | ||
| Llama 4 Maverick 17B-128E (Preview) | ||
| Llama 4 Scout 17B-16E (Preview) | ||
| MiniMax M2 | ||
| Multilingual E5 Large | ||
| Multilingual E5 Small | ||
| Qwen3 235B | ||
| Qwen3 Coder | ||
| Qwen3-Next-80B Instruct | ||
| Qwen3-Next-80B Thinking |
Asia Pacific
| Model | Singapore(asia-southeast1) | Taiwan(asia-east1) |
|---|---|---|
| DeepSeek-OCR | ||
| DeepSeek R1 (0528) | ||
| DeepSeek-V3.1 | ||
| gpt-oss 120B | ||
| gpt-oss 20B | ||
| Llama 3.1 70B (Preview) | ||
| Llama 3.1 8B (Preview) | ||
| Llama 3.2 90B (Preview) | ||
| Llama 3.3 70B (Preview) | ||
| Llama 3.1 405B | ||
| Llama 4 Maverick 17B-128E (Preview) | ||
| Llama 4 Scout 17B-16E (Preview) | ||
| MiniMax M2 | ||
| Multilingual E5 Large | ||
| Multilingual E5 Small | ||
| Qwen3 235B | ||
| Qwen3 Coder | ||
| Qwen3-Next-80B Instruct | ||
| Qwen3-Next-80B Thinking |
What's next
- Learn about Google Cloud regions.
Learn more about security controls by feature.
Learn about the models that provide Generative AI on Vertex AI support. See Generative AI foundational model reference.
Learn about Vertex AI locations.