← Back to tutorials

Google Cloud Functions + Vertex AI: How to Deploy AI with Cloud Functions (2026)

Complete integration guide for Google Cloud Functions and Vertex AI

Google Cloud Functions + Vertex AI Integration Guide 2026

Overview

This guide shows you exactly how to deploy AI with Cloud Functions using Google Cloud Functions and Vertex AI. We cover setup, core integration, and production-ready patterns.

Prerequisites

  • Google Cloud Functions environment set up
  • Vertex AI API key or access credentials
  • Basic understanding of Google Cloud Functions development
  • Installation

    bash
    

    Install required packages

    npm install vertex-ai google-cloud-functions-sdk

    or

    pip install vertex_ai google_cloud_functions

    Quick Setup

    javascript
    // Initialize Vertex AI client
    import { VertexAIClient } from 'vertex-ai';

    const client = new VertexAIClient({ apiKey: process.env.VERTEX_AI_API_KEY, // Additional config based on your Google Cloud Functions setup });

    Core Integration Code

    typescript
    // Complete Google Cloud Functions + Vertex AI integration
    import { OpenAI } from 'openai';
    import express from 'express';

    // AI endpoint app.post('/api/ai', async (req, res) => { const { message, context } = req.body; try { const response = await openai.chat.completions.create({ model: 'gpt-4o-mini', messages: [ { role: 'system', content: You are integrated with Google Cloud Functions. Help with deploy AI with Cloud Functions. }, { role: 'user', content: message } ], stream: false }); res.json({ response: response.choices[0].message.content, usage: response.usage }); } catch (error) { res.status(500).json({ error: error.message }); } });

    app.listen(3000);

    Google Cloud Functions-Specific Integration

    javascript
    // Google Cloud Functions specific patterns for Vertex AI integration

    // Pattern 2: Service layer class AIService { constructor(private readonly client: typeof openai) {} async process(input: string, systemPrompt: string = ''): Promise { const response = await this.client.chat.completions.create({ model: 'gpt-4o-mini', messages: [ ...(systemPrompt ? [{ role: 'system' as const, content: systemPrompt }] : []), { role: 'user' as const, content: input } ] }); return response.choices[0].message.content || ''; } }

    // Pattern 3: React hook (if applicable) function useAI() { const [response, setResponse] = useState(''); const [loading, setLoading] = useState(false); const query = async (message: string) => { setLoading(true); try { const res = await fetch('/api/ai', { method: 'POST', headers: { 'Content-Type': 'application/json' }, body: JSON.stringify({ message }) }); const data = await res.json(); setResponse(data.response); } finally { setLoading(false); } }; return { response, loading, query }; }

    Streaming Support

    typescript
    // Add streaming for better UX
    app.post('/api/ai/stream', async (req, res) => {
      const { message } = req.body;
      
      res.setHeader('Content-Type', 'text/event-stream');
      res.setHeader('Cache-Control', 'no-cache');
      res.setHeader('Connection', 'keep-alive');
      
      const stream = await openai.chat.completions.create({
        model: 'gpt-4o-mini',
        messages: [{ role: 'user', content: message }],
        stream: true
      });
      
      for await (const chunk of stream) {
        const content = chunk.choices[0]?.delta?.content;
        if (content) {
          res.write(data: ${JSON.stringify({ content })}\n\n);
        }
      }
      
      res.write('data: [DONE]\n\n');
      res.end();
    });
    

    Testing the Integration

    bash
    

    Unit test

    curl -X POST http://localhost:3000/api/ai \ -H "Content-Type: application/json" \ -d '{"message": "Test message for deploy AI with Cloud Functions"}'

    Expected:

    {"response": "AI response...", "usage": {...}}

    Load test

    ab -n 100 -c 10 -p test-payload.json -T application/json http://localhost:3000/api/ai

    Production Deployment

    yaml
    

    docker-compose.yml

    services: app: build: . environment: - OPENAI_API_KEY=${OPENAI_API_KEY} - NODE_ENV=production ports: - "3000:3000" healthcheck: test: ["CMD", "curl", "-f", "http://localhost:3000/health"] interval: 30s

    Common Issues

    Issue: Rate limit errors Solution: Implement exponential backoff and request queuing

    Issue: Slow response times Solution: Use streaming and show loading states to users

    Conclusion

    The Google Cloud Functions + Vertex AI integration is powerful and relatively straightforward. This guide gives you the foundation to deploy AI with Cloud Functions in production.


    *Google Cloud Functions + Vertex AI integration guide | May 2026*

    Also available in 中文.