arrow_backBack to all models
Googleactive

Gemini 2.0 Flash

Released at: February 5, 2025

API StatusAvailable for integration
Context Window1,048,576 tokens
Input Price / MTok$0.10
Output Price / MTok$0.40

Model Overview

Google · Active Model

  • Long-Context
  • API Available
  • Production Ready
  • Price History Tracked

Gemini 2.0 Flash is Google's previous-generation fast, cost-efficient multimodal model, offering a compelling balance of speed, capability, and price. It supports text, image, and audio inputs with native multimodal understanding, making it well-suited for high-volume classification, real-time content moderation, and data extraction pipelines. Gemini 2.0 Flash introduced Google's context caching feature, significantly reducing costs for repeated document processing. While the 3.x series has since succeeded it, Gemini 2.0 Flash remains a popular cost-optimized choice for teams with established Vertex AI workflows.

Developer
Google
Release Date
February 5, 2025
Context Window
1,048,576 tokens1,398 words
API Access
Publicly AvailableIntegrate via official API
Input Cost
$0.10per million tokens
Output Cost
$0.40per million tokens
SHARE MODEL:

Vetted Benchmarks

No recorded benchmark scores available for this model.

Input Price TrendLast 90 Days
Output Price TrendLast 90 Days

How does this model compare?

Evaluate benchmark standing and performance rankings vs all other tracked models.

Compare with another model →