Model Overview
Google · Active Model
- Long-Context
- API Available
- Production Ready
- Price History Tracked
Gemini 2.0 Flash is Google's previous-generation fast, cost-efficient multimodal model, offering a compelling balance of speed, capability, and price. It supports text, image, and audio inputs with native multimodal understanding, making it well-suited for high-volume classification, real-time content moderation, and data extraction pipelines. Gemini 2.0 Flash introduced Google's context caching feature, significantly reducing costs for repeated document processing. While the 3.x series has since succeeded it, Gemini 2.0 Flash remains a popular cost-optimized choice for teams with established Vertex AI workflows.
- Developer
- Release Date
- February 5, 2025
- Context Window
- 1,048,576 tokens≈ 1,398 words
- API Access
- Publicly AvailableIntegrate via official API
- Input Cost
- $0.10per million tokens
- Output Cost
- $0.40per million tokens