Skills2026年4月6日·1 分钟阅读

Browser-Use Web UI — Visual AI Browser Automation

Gradio-based web interface for Browser-Use AI agent. Automate web browsing with visual feedback, persistent sessions, and HD recording. Supports 6+ LLM providers. 15,800+ stars, MIT.

Agent 就绪

先审查再安装

这个资产需要先审查。复制的指令会要求 Agent dry-run、列出写入项,确认后再继续。

Needs Confirmation · 66/100策略:需确认
Agent 入口
任意 MCP/CLI Agent
类型
Skill
安装
Single
信任
信任等级:Community
入口
Browser-Use Web UI — Visual AI Browser Automation
先审查命令
npx -y tokrepo@latest install badeea82-594a-4dbe-8cd0-9c3283235d1d --target codex

先 dry-run,确认写入项后再运行此命令。

TL;DR
Gradio-based web interface for Browser-Use AI agent with visual feedback, persistent sessions, and recording support.
§01

What it is

Browser-Use Web UI is a Gradio-based interface that wraps the Browser-Use AI agent. It lets you automate web browsing tasks through a visual dashboard rather than raw scripts. The agent navigates pages, fills forms, clicks buttons, and extracts data while you watch in real time.

This tool is built for QA engineers, growth hackers, and anyone who needs repeatable browser automation without writing Selenium or Playwright code from scratch. It supports six or more LLM providers for the underlying AI reasoning.

§02

How it saves time or tokens

Traditional browser automation requires writing and maintaining brittle selectors. Browser-Use replaces CSS selectors with natural language instructions interpreted by an LLM. The visual feedback loop catches failures immediately instead of after a full test suite run. Persistent sessions mean you do not re-authenticate on every run.

§03

How to use

  1. Clone the Browser-Use Web UI repository and install dependencies.
  2. Launch the Gradio server locally.
  3. Open the web UI, select your LLM provider, and type a natural language task.
# Clone and install
git clone https://github.com/browser-use/web-ui.git
cd web-ui
pip install -r requirements.txt

# Launch
python app.py
# Open http://localhost:7860 in your browser
§04

Example

Task: 'Go to Hacker News, find the top 3 posts, and save their titles and URLs to a CSV file.'

The agent will:
1. Navigate to news.ycombinator.com
2. Identify the top 3 posts by rank
3. Extract title and URL for each
4. Write results to output.csv
§05

Related on TokRepo

§06

Common pitfalls

  • Running without a visible browser window (headless mode) disables the visual feedback feature that makes debugging possible.
  • Some LLM providers have rate limits that cause the agent to stall mid-task; set appropriate retry and backoff parameters.
  • Persistent sessions store cookies locally; clear them between unrelated tasks to avoid cross-contamination of auth state.

常见问题

Which LLM providers does Browser-Use Web UI support?+

Browser-Use Web UI supports six or more LLM providers including OpenAI, Anthropic Claude, Google Gemini, and local models via Ollama. You select your provider and API key in the web UI settings panel before running a task.

Can I record the browser session as a video?+

Yes. The Web UI supports HD recording of browser sessions. Enable recording in the settings panel before starting a task. The output is saved as a video file you can review or share with your team.

Does it work with websites that require login?+

Yes. Persistent sessions retain cookies and auth tokens across runs. You can log in once during the first session, and subsequent runs will reuse the authenticated state until cookies expire or you clear them manually.

How does it compare to Playwright or Selenium?+

Browser-Use replaces CSS/XPath selectors with natural language instructions. You describe what you want done, and the AI figures out how to interact with the page. This eliminates selector maintenance but trades off precise control for convenience.

Is Browser-Use Web UI free to use?+

The Web UI itself is open source and free under the MIT license. However, you pay for the LLM API calls made during automation. Using a local model via Ollama eliminates API costs entirely.

引用来源 (3)
🙏

来源与感谢

Created by Browser-Use. Licensed under MIT.

web-ui — ⭐ 15,800+

讨论

登录后参与讨论。
还没有评论,来写第一条吧。

相关资产