What pricing models does Google offer for Gemini API access on Vertex AI?

Question

Accepted Answer

## Overview

Google offers flexible pricing for Gemini API access on Vertex AI based on token consumption.

## Key Features

Pricing tiers include Gemini 3 Flash ($0.50/1M input tokens), Gemini 3 Pro ($2.00/1M input), and Gemini 1.5 Pro.

## Technical Specifications

Context caching reduces costs at $0.25 per 1M cached tokens per hour. Batch prediction offers reduced pricing.

## How It Works

Provisioned Throughput provides guaranteed capacity for production workloads at committed pricing.

## Use Cases

## Limitations and Requirements

Rich media inputs (video, audio, images) consume significantly more tokens than text.

## Comparison to Alternatives

## Summary

In conclusion, Google provides multiple pricing options to match different enterprise needs and usage patterns.

Google Gemini