Skip to content
OpenCatalogcurated by FLOSSK
AI & Machine Learning

UI-TARS

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

Why it is included

Agent TARS is a general multimodal AI Agent stack, it brings the power of GUI Agent and Vision into your terminal, computer, browser and product. It primarily ships with a CLI and Web UI for usage. It aims to provide a workflow that is closer to human-like task completion through cutting-edge multimodal LLMs and seamless integration with various real-world MCP tools.

Best for

Users exploring vetted FOSS alternatives in this space (agent).

Strengths

  • ~29,227 GitHub stars (per upstream list)
  • Open source

Limitations

  • Verify license, platform support, and security posture for your environment.

Good alternatives

Related tools