Model Overview
Google · Active Model
- Long-Context
- API Available
- Production Ready
Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...
- Developer
- Release Date
- May 7, 2026
- Context Window
- 1,048,576 tokens≈ 1,398 words
- API Access
- Publicly AvailableIntegrate via official API
- Input Cost
- $0.25per million tokens
- Output Cost
- $1.50per million tokens