概要
WebwrightはLLMにターミナルを提供し、ブラウザセッションを起動してエンドツーエンドでWebタスクを完了します。コード・アズ・アクション・パラダイムを採用し、エージェントがPlaywrightスクリプトを作成・実行します。フレームワークは極めてシンプル(約1.5k行のコード)で、複数のモデルバックエンドをサポートしています。
README プレビュー
# Webwright\n\n\n \n\n\nTurn Your Coding Models to Be State-of-the-art Browser Agents\n\n\n \n \n \n \n\n\n- 📝 **Blog:** [Webwright: A Terminal Is All You Need For Web Agents](https://www.microsoft.com/en-us/research/articles/webwright-a-terminal-is-all-you-need-for-web-agents/)\n- 🌐 **Project Page:** [microsoft.github.io/Webwright](https://microsoft.github.io/Webwright/)\n\nWebwright gives LLM a terminal where it can launch multiple browser sessions to inspect the page and complete a web task. It captures and inspects page screenshots/states only when needed. It enforces each web task to be completed end-to-end within a re-runnable Python script, i.e. your web agent browsing history is a single code file. No multi-agent system, no graph engine, no plugin layer, no hidden orchestration — just a terminal, a browser, and a model.\n\nAlready got your favorite agents, and wonder how to make Claude Code, Codex, Hermes, OpenClaw more capable in browser tasks? Consider adding [Webwright plugin/skills](#-use-as-a-claude-code-skill)!\n\n---\n\n## 📰 News\n\n- **2026-05-11** — Support Task2UI mode: Webwright completes the task and renders task results into an HTML-based web app you can easily view and reuse. \n- **2026-05-06** — Codex and Claude Code plugin manifests added; install via `/plugin install webwright@webwright`. OpenClaw and Hermes Agent integrations shipped; the same `skills/webwright/` folder now loads across Claude Code, Codex, OpenClaw, and Hermes.\n- **2026-05-04** — Initial public release: ~1.5k LoC, OpenAI / Anthropic / OpenRouter backends, Playwright environment.\n\n---\n\n\n💡 Motivation: Beyond Step-by-Step Web Interaction in a Stateful Browser\n\nMost web agents today treat the browser session itself as the workspace: at each step the model receives the current page state and predicts a single next operation — a click, a type, a DOM selector, or a short tool call. Whatever the format, the agent is locked into predicting one web action at a time