Cette page est affichée en anglais. Une traduction française est en cours.

SkillsApr 6, 2026·3 min de lecture

Browser-Use Web UI — Visual AI Browser Automation

Gradio-based web interface for Browser-Use AI agent. Automate web browsing with visual feedback, persistent sessions, and HD recording. Supports 6+ LLM providers. 15,800+ stars, MIT.

Browser Use · Community

Prêt pour agents

Installation avec revue préalable

Cet actif nécessite une revue. Le prompt copié demande un dry-run, affiche les écritures, puis continue seulement après confirmation.

Needs Confirmation · 66/100Policy : confirmer

Surface agent

Tout agent MCP/CLI

Type

Skill

Installation

Single

Confiance

Confiance : Community

Point d'entrée

Browser-Use Web UI — Visual AI Browser Automation

Commande avec revue préalable

npx -y tokrepo@latest install badeea82-594a-4dbe-8cd0-9c3283235d1d --target codex

Dry-run d'abord, confirmez les écritures, puis lancez cette commande.

TL;DR

Gradio-based web interface for Browser-Use AI agent with visual feedback, persistent sessions, and recording support.

§01

What it is

Browser-Use Web UI is a Gradio-based interface that wraps the Browser-Use AI agent. It lets you automate web browsing tasks through a visual dashboard rather than raw scripts. The agent navigates pages, fills forms, clicks buttons, and extracts data while you watch in real time.

This tool is built for QA engineers, growth hackers, and anyone who needs repeatable browser automation without writing Selenium or Playwright code from scratch. It supports six or more LLM providers for the underlying AI reasoning.

§02

How it saves time or tokens

Traditional browser automation requires writing and maintaining brittle selectors. Browser-Use replaces CSS selectors with natural language instructions interpreted by an LLM. The visual feedback loop catches failures immediately instead of after a full test suite run. Persistent sessions mean you do not re-authenticate on every run.

§03

How to use

Clone the Browser-Use Web UI repository and install dependencies.
Launch the Gradio server locally.
Open the web UI, select your LLM provider, and type a natural language task.

# Clone and install
git clone https://github.com/browser-use/web-ui.git
cd web-ui
pip install -r requirements.txt

# Launch
python app.py
# Open http://localhost:7860 in your browser

§04

Example

Task: 'Go to Hacker News, find the top 3 posts, and save their titles and URLs to a CSV file.'

The agent will:
1. Navigate to news.ycombinator.com
2. Identify the top 3 posts by rank
3. Extract title and URL for each
4. Write results to output.csv

§05

Related on TokRepo

Browser automation tools — Compare other AI-powered browser automation options
Web scraping tools — Related data extraction workflows

§06

Common pitfalls

Running without a visible browser window (headless mode) disables the visual feedback feature that makes debugging possible.
Some LLM providers have rate limits that cause the agent to stall mid-task; set appropriate retry and backoff parameters.
Persistent sessions store cookies locally; clear them between unrelated tasks to avoid cross-contamination of auth state.

Questions fréquentes

Which LLM providers does Browser-Use Web UI support?+

Browser-Use Web UI supports six or more LLM providers including OpenAI, Anthropic Claude, Google Gemini, and local models via Ollama. You select your provider and API key in the web UI settings panel before running a task.

Can I record the browser session as a video?+

Yes. The Web UI supports HD recording of browser sessions. Enable recording in the settings panel before starting a task. The output is saved as a video file you can review or share with your team.

Does it work with websites that require login?+

Yes. Persistent sessions retain cookies and auth tokens across runs. You can log in once during the first session, and subsequent runs will reuse the authenticated state until cookies expire or you clear them manually.

How does it compare to Playwright or Selenium?+

Browser-Use replaces CSS/XPath selectors with natural language instructions. You describe what you want done, and the AI figures out how to interact with the page. This eliminates selector maintenance but trades off precise control for convenience.

Is Browser-Use Web UI free to use?+

The Web UI itself is open source and free under the MIT license. However, you pay for the LLM API calls made during automation. Using a local model via Ollama eliminates API costs entirely.

Sources citées (3)

Browser-Use Web UI GitHub— Browser-Use Web UI provides a Gradio interface for browser automation agents
Gradio Documentation— Gradio enables rapid UI prototyping for ML applications
Browser-Use GitHub— Browser-Use is an AI browser automation framework

En lien sur TokRepo

Browser automation tools Web scraping tools Automation tools

🙏

Source et remerciements

Created by Browser-Use. Licensed under MIT.

web-ui — ⭐ 15,800+

Fil de discussion

Connectez-vous pour rejoindre la discussion.

Aucun commentaire pour l'instant. Soyez le premier à partager votre avis.

Actifs similaires

Stable Diffusion Web UI by AUTOMATIC1111 — The Definitive Local AI Image Generator

AUTOMATIC1111's Stable Diffusion Web UI is the most popular interface for running Stable Diffusion locally. It supports text-to-image, image-to-image, inpainting, ControlNet, LoRA, embeddings, extensions, and every model variant — all in a self-hosted browser UI.

Skills

Script Depot

Crawlee — Web Scraping and Browser Automation Library

Build reliable web scrapers in Node.js or Python. Crawlee handles proxy rotation, browser fingerprints, auto-scaling, and anti-bot bypassing out of the box.

Skills

Apify

pgweb — Web-Based PostgreSQL Database Browser

pgweb is a cross-platform web GUI for PostgreSQL written in Go. It provides a clean browser interface for running queries, browsing tables, and exporting data, all from a single binary with no dependencies.

Skills

Script Depot

File Browser — Self-Hosted Web File Manager

File Browser is a lightweight web-based file manager that lets you manage files on your server through a clean browser interface. It supports uploads, downloads, sharing, and user management with no external dependencies.

Skills

AI Open Source