Model Overview
Meta · Active Model
- Long-Context
- API Available
- Production Ready
- Price History Tracked
Llama 3.1 8B is Meta's lightweight open-weight model from the Llama 3.1 generation, optimized for efficient deployment on consumer hardware and edge devices. Despite its compact 8-billion-parameter size, it delivers strong performance on instruction following, text summarization, and lightweight coding tasks. Lllama 3.1 8B is the most downloaded model in the Llama family and runs efficiently on laptops, single GPUs, and CPU via quantization — making it the default choice for on-device AI applications and local prototyping.
- Developer
- Meta
- Release Date
- July 23, 2024
- Context Window
- 131,072 tokens≈ 175 words
- API Access
- Publicly AvailableIntegrate via official API
- Input Cost
- $0.04per million tokens
- Output Cost
- $0.04per million tokens