Model Overview
ByteDance · Active Model
- Long-Context
- API Available
- Vetted Benchmarks
- Production Ready
UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...
- Developer
- ByteDance
- Release Date
- July 22, 2025
- Context Window
- 128,000 tokens≈ 171 words
- API Access
- Publicly AvailableIntegrate via official API
- Input Cost
- $0.10per million tokens
- Output Cost
- $0.20per million tokens