Browser Automation

Best AI Tools for Browser Automation (2026)

AI-powered browser agents, web scraping tools, and testing frameworks. Automate web interactions with natural language instructions.

30 tools
Playwright MCP — Browser Automation for AI Agents logo

Playwright MCP — Browser Automation for AI Agents

Official Playwright MCP server that gives AI agents full browser control — navigate pages, fill forms, click buttons, take screenshots, and run end-to-end tests. 3,000+ stars.

Microsoft AI 225MCP Configs
Chrome MCP Operations Runbook — Iron Rules, Architecture & Troubleshooting logo

Chrome MCP Operations Runbook — Iron Rules, Architecture & Troubleshooting

Operations skill for running chrome-devtools-mcp against your real Chrome at scale. Covers the proxy architecture, five iron rules (always-via-proxy, real-browser-only, no-env-proxy, never-kill-current-session, persistent-proxy), Chrome 146+ remote-debug popup workaround, multi-agent isolation guarantees, configuration recipes for ~/.mcp.json and ~/.claude/settings.json (with the 'no glob in permissions' gotcha), step-by-step troubleshooting flow, and four field notes from real incidents — port cleanup heuristics that backfire, protocol-layer hang detection, why 'newest = keep' is wrong, and why heavy pages need filePath-first take_snapshot to avoid 25k token overflow. Pairs with the 'Chrome MCP Background Proxy' script bundle.

henuwangkai 224SkillsMCP Configs
Stagehand — AI-Powered Browser Automation SDK logo

Stagehand — AI-Powered Browser Automation SDK

TypeScript SDK that lets you automate browsers using natural language and visual understanding. AI sees the page like a human does. Built on Playwright. 10,000+ GitHub stars.

Browserbase 222Skills
Nanobrowser — AI Web Automation Chrome Extension logo

Nanobrowser — AI Web Automation Chrome Extension

Open-source Chrome extension with multi-agent AI for web automation. Free alternative to OpenAI Operator. 12K+ stars.

AI Open Source 216Skills
Chrome DevTools MCP — Browser Debugging for AI Agents logo

Chrome DevTools MCP — Browser Debugging for AI Agents

Give your AI coding agent full access to Chrome DevTools for browser automation, debugging, and performance analysis. Works with Claude, Cursor, Copilot, and 15+ AI tools.

MCP Hub 213MCP Configs
Multi-Browser MCP Proxies — Arc Browser & Chrome Beta Variants logo

Multi-Browser MCP Proxies — Arc Browser & Chrome Beta Variants

Companion to 'Chrome MCP Background Proxy' for running parallel, isolated MCP fleets against Arc Browser and Chrome Beta on top of the same cdp-proxy.mjs. Arc-specific proxy auto-discovers the WebSocket path from /json/version (Arc doesn't write a DevToolsActivePort file in the standard location); Chrome Beta proxy points at Beta's own DevToolsActivePort. Lets you run mcp__chrome__*, mcp__beta__*, and mcp__arc__* side-by-side with independent client state and no cross-talk.

henuwangkai 206MCP Configs
WXT — Next-Gen Framework for Browser Extension Development logo

WXT — Next-Gen Framework for Browser Extension Development

A TypeScript-first framework for building cross-browser extensions with hot reload, auto-imports, and built-in support for Chrome, Firefox, Safari, and Edge from a single codebase.

AI Open Source 200Skills
Chrome Fleet — Multi-Agent Browser Pool with Shared Login State logo

Chrome Fleet — Multi-Agent Browser Pool with Shared Login State

Multi-agent control plane for chrome-devtools-mcp. Two modes: (1) shared main Chrome — N CDP proxies on 9401/9402/9403... all multiplexing onto one logged-in Chrome :9222 so every agent inherits your real cookies/extensions, with focus protection and ID isolation handled by cdp-proxy.mjs; (2) isolated agent Chromes — dedicated Chrome instance per agent on :930N with its own user-data-dir for multi-account / persona-isolation testing. Includes a status tool to inspect the running fleet.

henuwangkai 196MCP Configs
Selenium — Browser Automation Framework and Ecosystem logo

Selenium — Browser Automation Framework and Ecosystem

Selenium is the original browser automation framework for testing web applications. WebDriver API supports Chrome, Firefox, Safari, Edge across Java, Python, C#, Ruby, JavaScript. The industry standard for E2E web testing since 2004.

Script Depot 194Skills
Pydoll — Browser Automation Without WebDriver logo

Pydoll — Browser Automation Without WebDriver

Python async browser automation via Chrome DevTools Protocol. Built-in CAPTCHA solving, anti-detection, no Selenium needed. 6.7K+ stars.

Script Depot 194Scripts
Claude Official Skill: webapp-testing logo

Claude Official Skill: webapp-testing

Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots...

Anthropic 186Skills
Playwright MCP Server — Browser Automation for AI logo

Playwright MCP Server — Browser Automation for AI

Official Playwright MCP server for browser automation. Lets Claude Code, Cursor, and other AI tools navigate pages, fill forms, click buttons, take screenshots, and run end-to-end tests via natural la

Microsoft AI 178MCP Configs
Plasmo — The Browser Extension Framework logo

Plasmo — The Browser Extension Framework

Build, test, and publish browser extensions for Chrome, Firefox, and Edge using React or Vue with hot-reload and automatic manifest generation.

Script Depot 176Skills
Playwright MCP — Browser Automation Server logo

Playwright MCP — Browser Automation Server

Playwright MCP is an MCP server for browser automation via Playwright snapshots. Add via npx in Claude Code/Codex to run deterministic actions.

MCP Hub 165MCP Configs
Playwright — Cross-Browser End-to-End Testing Framework logo

Playwright — Cross-Browser End-to-End Testing Framework

Reliable end-to-end testing for modern web apps across Chromium, Firefox, and WebKit with a single API.

Microsoft AI 151Skills
WebdriverIO — Next-Gen Browser and Mobile Testing Framework logo

WebdriverIO — Next-Gen Browser and Mobile Testing Framework

WebdriverIO is a progressive automation framework for web and mobile testing built on the WebDriver and Chrome DevTools protocols with a rich plugin system.

AI Open Source 135Skills
Playwright MCP — Browser Automation for Agents logo

Playwright MCP — Browser Automation for Agents

Playwright MCP exposes browser automation via MCP with device emulation; verified 5,510★ and documents 143 device profiles plus `playwright install` setup.

MCP Hub 99MCP Configs
Surf CLI — Control Chrome for AI Agents (No MCP) logo

Surf CLI — Control Chrome for AI Agents (No MCP)

Surf CLI lets agents control Chrome via a local extension + native host, offering agent-agnostic browser control without running an MCP server.

Script Depot 98CLI Tools
bb-browser — Browser-as-API CLI + MCP Server logo

bb-browser — Browser-as-API CLI + MCP Server

Use your real Chrome login state as an API: bb-browser provides a CLI + MCP server with 103 commands across 36 platforms (Twitter/Reddit/YouTube/etc.).

MCP Hub 94MCP Configs
notebooklm-py — NotebookLM CLI + Python API Skill logo

notebooklm-py — NotebookLM CLI + Python API Skill

notebooklm-py provides a CLI + Python API for NotebookLM with agent hooks; verified 13,142★ and uses Playwright Chromium for browser login.

Skill Factory 91Skills
Docker Selenium Grid — Containerized Browser Testing at Scale logo

Docker Selenium Grid — Containerized Browser Testing at Scale

Docker Selenium provides pre-built container images to run Selenium Grid with Chrome, Firefox, and Edge, enabling scalable browser automation in CI/CD pipelines.

Script Depot 84Skills
Browserbase — Cloud Browser Infra for AI Agents logo

Browserbase — Cloud Browser Infra for AI Agents

Browserbase runs managed cloud Chromium for AI agents. Stagehand, Playwright, Puppeteer compatible. Scales to 1000s of parallel sessions with replay.

Browserbase 77Workflows
Chrome MCP Server — Extension-Based Browser MCP logo

Chrome MCP Server — Extension-Based Browser MCP

mcp-chrome turns Chrome into an MCP server via an extension + bridge. Install mcp-chrome-bridge, load the extension, then connect to 127.0.0.1:12306/mcp.

MCP Hub 75MCP Configs
agent-browser — Browser Automation CLI for AI Agents logo

agent-browser — Browser Automation CLI for AI Agents

Rust browser automation CLI for agents; verified 32,921 stars with Chrome-for-Testing install and snapshot+click+fill commands for repeatable runs.

Script Depot 5CLI Tools
Chrome MCP Background Proxy — Fix Popups, Focus Stealing & Multi-Agent Conflicts logo

Chrome MCP Background Proxy — Fix Popups, Focus Stealing & Multi-Agent Conflicts

Persistent CDP proxy + entry script that lets chrome-devtools-mcp run against your real, logged-in Chrome without the Chrome 146+ consent popup spamming on every connection, without focus stealing (Target.activateTarget / Page.bringToFront are intercepted, createTarget is forced to background), and without request-ID / event collisions when multiple Claude Code windows or sub-agents share one Chrome. Includes the proxy core (cdp-proxy.mjs v3), entry script, safe cleanup, pre-flight healthcheck, and a launchd-style self-healing watchdog with Feishu alerts.

henuwangkai 495MCP ConfigsScripts
1
Multi-Platform Social Media Growth Skill for Developer Communities logo

Multi-Platform Social Media Growth Skill for Developer Communities

Production-tested Claude Code skill for automating authentic developer engagement across 10+ platforms (Twitter, Bluesky, Dev.to, HN, Reddit, GitHub, Product Hunt, Hashnode, Medium, Quora). Includes rotation queue, per-platform cooldowns, action diversity rules, Chrome MCP toolchain with JS injection patterns, anti-ban safety, and continuous optimization framework. Battle-tested across 30+ cycles.

henuwangkai 260Skills
1
Browser Use — AI Agent Browser Automation logo

Browser Use — AI Agent Browser Automation

Let AI agents control web browsers with natural language. Browser Use provides vision-based element detection, multi-tab support, and works with any LLM provider.

Browser Use 236MCP Configs
Browser-Use Web UI — Visual AI Browser Automation logo

Browser-Use Web UI — Visual AI Browser Automation

Gradio-based web interface for Browser-Use AI agent. Automate web browsing with visual feedback, persistent sessions, and HD recording. Supports 6+ LLM providers. 15,800+ stars, MIT.

Browser Use 232Skills
Stagehand — AI Browser Automation Framework logo

Stagehand — AI Browser Automation Framework

Three AI primitives — act(), extract(), observe() — to automate any website with natural language. By Browserbase. 21K+ stars.

Browserbase 204Scripts
Obscura — Headless Browser Built for AI Agents and Web Scraping logo

Obscura — Headless Browser Built for AI Agents and Web Scraping

A high-performance headless browser written in Rust, designed specifically for AI agent workflows and large-scale web scraping with built-in stealth and anti-detection capabilities.

Script Depot 199Skills

AI Browser Agents

AI Browser Agents

Browser automation has evolved from brittle CSS selectors to intelligent AI agents that understand web pages like humans do. Visual Web Agents — AI tools like Browser Use and LaVague navigate websites using visual understanding and natural language instructions. They click buttons, fill forms, and extract data without writing a single selector.

Chrome MCP Servers — Connect Claude Code and other AI assistants to a live browser via Model Context Protocol. Take screenshots, run JavaScript, inspect network requests, and interact with web pages — all from your AI coding tool. Essential for debugging, testing, and automating web workflows.

Web Scraping — AI-powered scrapers that understand page structure, handle dynamic content (SPAs, infinite scroll, lazy loading), and extract structured data without manual configuration. E2E Testing — AI agents that generate and maintain Playwright or Puppeteer test suites from natural language descriptions of user flows. They auto-heal broken selectors and adapt to UI changes.

The browser is the universal API — AI agents that can navigate it can automate anything.

Frequently Asked Questions

What is the best AI tool for browser automation?+

For general automation, Browser Use and LaVague are leading AI-native browser agents — they navigate sites using visual understanding. For developer workflows, Chrome MCP servers (connecting AI tools to a live browser) are the most practical. For testing, Playwright with AI-powered test generation offers the best balance of reliability and ease of use.

How do AI browser agents work?+

AI browser agents take screenshots of web pages, use vision models to understand the layout and content, and generate click/type/scroll actions to accomplish goals. Unlike traditional automation (which uses CSS selectors), AI agents adapt to UI changes automatically. They can follow multi-step instructions like "find the pricing page and extract all plan features into a table."

Can AI replace traditional web scraping?+

For many use cases, yes. AI scrapers understand page structure without manual selector configuration, handle JavaScript-rendered content natively, and adapt when websites change their layout. They're particularly strong for unstructured data extraction. However, for high-volume production scraping, traditional tools with explicit selectors remain more reliable and faster.

Explore Related Categories