Vertex AI

Configure Google Cloud Vertex AI as an LLM provider in agentgateway.

Authentication

Before you can use Vertex AI as an LLM provider, you must authenticate by using Google Cloud’s Application Default Credentials. Choose from one of the three methods:

  • GOOGLE_APPLICATION_CREDENTIALS
  • application_default_credentials.json
  • metadata server

Configuration

Review the following example configuration.
# yaml-language-server: $schema=https://agentgateway.dev/schema/config
binds:
- port: 3000
  listeners:
  - routes:
    - backends:
      - ai:
          name: vertex
          provider:
            vertex:
              projectId: my-project-id
               # Optional: defaults to 'global'
              region: us-west2
              # Optional; overrides the model in requests
              model: google/gemini-2.5-flash-lite-preview-06-17
Review the following example configuration.
SettingDescription
ai.nameThe name of the LLM provider for this AI backend.
vertex.projectIdThe Google Cloud project ID.
vertex.regionThe Google Cloud region. Defaults to global.
vertex.modelOptionally set the model to use for requests. If set, any models in the request are overwritten. If not set, the request must include the model to use.
Agentgateway assistant

Ask me anything about agentgateway configuration, features, or usage.

Note: AI-generated content might contain errors; please verify and test all returned information.

Tip: one topic per conversation gives the best results. Use the + button in the chat header to start a new conversation.

Switching topics? Starting a new conversation improves accuracy.
↑↓ navigate select esc dismiss

What could be improved?

Your feedback helps us improve assistant answers and identify docs gaps we should fix.

Need more help? Join us on Discord: https://discord.gg/y9efgEmppm

Want to use your own agent? Add the Solo MCP server to query our docs directly. Get started here: https://search.solo.io/.