Model Overview
Google · Active Model
- Long-Context
- API Available
- Production Ready
Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...
- Developer
- Release Date
- April 3, 2026
- Context Window
- 262,144 tokens≈ 350 words
- API Access
- Publicly AvailableIntegrate via official API
- Input Cost
- $0.06per million tokens
- Output Cost
- $0.33per million tokens