UI-TARS Desktop icon

UI-TARS Desktop

Open-source GUI agent powered by vision-language AI for natural language computer control

Free Cowork Open Source

Overview

UI-TARS Desktop is an open-source desktop application from ByteDance that enables natural language control of your computer through advanced vision-language AI. Using the UI-TARS model, it analyzes your screen in real-time and executes precise mouse and keyboard actions based on your instructions. Automate complex multi-step workflows, navigate applications, fill forms, and complete tasks by simply describing what you want to do. All processing runs locally for complete privacy, with support for multiple operator types including local computer, remote computer, and browser automation.

Pricing: Free (open source)

Architecture: Apple Silicon, Intel

Key Features

  • Vision-language AI that understands screen content and executes GUI actions
  • Natural language control of any application through conversational instructions
  • Real-time screenshot analysis and visual understanding
  • Precise mouse and keyboard automation based on visual context
  • 100% local processing for complete privacy
  • Multiple operator types: local computer, remote computer, browser
  • Cross-platform support for macOS, Windows, and Linux
  • MCP (Model Context Protocol) integration
  • Live status updates and real-time feedback during task execution
  • Open source under Apache 2.0 license

Tags

other