Model Overview
Google · Active Model
- Long-Context
- API Available
- Production Ready
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...
- Developer
- Release Date
- September 25, 2025
- Context Window
- 1,048,576 tokens≈ 1,398 words
- API Access
- Publicly AvailableIntegrate via official API
- Input Cost
- $0.10per million tokens
- Output Cost
- $0.40per million tokens