OpenSource-Hub

UI-TARS-desktop

프레임워크

bytedance/UI-TARS-desktop

开源多模态AI代理框架,连接模型与代理基础设施。

개요

该项目是一个多模态AI代理栈,包含Agent TARS(命令行/Web界面)和UI-TARS Desktop桌面应用。支持图形界面代理、浏览器和计算机操作,通过MCP工具集成实现类人任务自动化。

README 미리보기

\n  \n\n\n\n\n## Introduction\n\nEnglish | [简体中文](./README.zh-CN.md)\n\n[](https://trendshift.io/repositories/13584)\n\nTARS\* is a Multimodal AI Agent stack, currently shipping two projects: [Agent TARS](#agent-tars) and [UI-TARS-desktop](#ui-tars-desktop):\n\n\n  \n    \n      Agent TARS\n      UI-TARS-desktop\n    \n  \n  \n    \n      \n        \n      \n      \n        \n      \n    \n    \n      \n        Agent TARS is a general multimodal AI Agent stack, it brings the power of GUI Agent and Vision into your terminal, computer, browser and product.\n        \n        \n        It primarily ships with a CLI and Web UI for usage.\n        It aims to provide a workflow that is closer to human-like task completion through cutting-edge multimodal LLMs and seamless integration with various real-world MCP tools.\n      \n      \n        UI-TARS Desktop is a desktop application that provides a native GUI Agent based on the UI-TARS model.\n        \n        \n        It primarily ships a\n        local and \n        remote computer as well as browser operators.\n      \n    \n  \n\n\n## Table of Contents\n\n\n\n\n- [News](#news)\n- [Agent TARS](#agent-tars)\n  - [Showcase](#showcase)\n  - [Core Features](#core-features)\n  - [Quick Start](#quick-start)\n  - [Documentation](#documentation)\n- [UI-TARS Desktop](#ui-tars-desktop)\n  - [Showcase](#showcase-1)\n  - [Features](#features)\n  - [Quick Start](#quick-start-1)\n- [Contributing](#contributing)\n- [License](#license)\n- [Citation](#citation)\n\n\n\n## News\n\n- **\[2025-11-05\]** 🎉 We're excited to announce the release of [Agent TARS CLI v0.3.0](https://github.com/bytedance/UI-TARS-desktop/releases/tag/v0.3.0)! This version brings streaming support for multiple tools (shell commands, multi-file structured display), runtime settings with timing statistics for tool calls and deep thinking, Event Stream Viewer for data flow tracking and debugging. Additionally, it features exclusive support for [AIO agent Sandbox](ht