Browser Automation

Mejores herramientas de IA para automatización del navegador (2026)

Agents IA de navegador, herramientas de web scraping y frameworks de testing potenciados por IA. Automatiza interacciones web con instrucciones en lenguaje natural.

30 herramientas

WXT — Next-Gen Framework for Browser Extension Development

A TypeScript-first framework for building cross-browser extensions with hot reload, auto-imports, and built-in support for Chrome, Firefox, Safari, and Edge from a single codebase.

AI Open Source 495Skills

Stagehand — AI-Powered Browser Automation SDK

TypeScript SDK that lets you automate browsers using natural language and visual understanding. AI sees the page like a human does. Built on Playwright. 10,000+ GitHub stars.

Browserbase 458Skills

Nanobrowser — AI Web Automation Chrome Extension

Open-source Chrome extension with multi-agent AI for web automation. Free alternative to OpenAI Operator. 12K+ stars.

AI Open Source 456Skills

Multi-Browser MCP Proxies — Arc Browser & Chrome Beta Variants

Companion to 'CDP WebSocket 代理' for running parallel, isolated MCP fleets against Arc Browser and Chrome Beta on top of the same cdp-proxy.mjs. Portable $HOME paths + install steps included. Arc proxy auto-discovers the WebSocket path from /json/version; Chrome Beta proxy points at Beta's own DevToolsActivePort. Lets you run mcp__chrome__*, mcp__beta__*, and mcp__arc__* side-by-side with independent client state and no cross-talk.

henuwangkai 446MCP Configs

Playwright MCP — Browser Automation for AI Agents

Official Playwright MCP server that gives AI agents full browser control — navigate pages, fill forms, click buttons, take screenshots, and run end-to-end tests. 3,000+ stars.

Microsoft AI 445MCP Configs

Chrome DevTools MCP — Browser Debugging for AI Agents

Give your AI coding agent full access to Chrome DevTools for browser automation, debugging, and performance analysis. Works with Claude, Cursor, Copilot, and 15+ AI tools.

MCP Hub 422MCP Configs

Pydoll — Browser Automation Without WebDriver

Python async browser automation via Chrome DevTools Protocol. Built-in CAPTCHA solving, anti-detection, no Selenium needed. 6.7K+ stars.

Script Depot 411Scripts

Claude Official Skill: webapp-testing

Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots...

Anthropic 392Skills

Chrome Fleet — Multi-Agent Browser Pool with Shared Login State

Multi-agent control plane for chrome-devtools-mcp. Portable $HOME paths + install steps included. Two modes: (1) shared main Chrome — N CDP proxies on 9401/9402/9403... all multiplexing onto one logged-in Chrome :9222 with focus protection and ID isolation handled by cdp-proxy.mjs; (2) isolated agent Chromes — dedicated Chrome instance per agent on :930N with its own user-data-dir. Includes a status tool to inspect the running fleet.

henuwangkai 373MCP Configs

Playwright MCP Server — Browser Automation for AI

Official Playwright MCP server for browser automation. Lets Claude Code, Cursor, and other AI tools navigate pages, fill forms, click buttons, take screenshots, and run end-to-end tests via natural la

Microsoft AI 371MCP Configs

Playwright — Cross-Browser End-to-End Testing Framework

Reliable end-to-end testing for modern web apps across Chromium, Firefox, and WebKit with a single API.

Microsoft AI 371Skills

Playwright MCP — Browser Automation Server

Playwright MCP is an MCP server for browser automation via Playwright snapshots. Add via npx in Claude Code/Codex to run deterministic actions.

MCP Hub 365MCP Configs

Selenium — Browser Automation Framework and Ecosystem

Selenium is the original browser automation framework for testing web applications. WebDriver API supports Chrome, Firefox, Safari, Edge across Java, Python, C#, Ruby, JavaScript. The industry standard for E2E web testing since 2004.

Script Depot 363Skills

Plasmo — The Browser Extension Framework

Build, test, and publish browser extensions for Chrome, Firefox, and Edge using React or Vue with hot-reload and automatic manifest generation.

Script Depot 361Skills

Playwright MCP — Browser Automation for Agents

Playwright MCP exposes browser automation via MCP with device emulation; verified 5,510★ and documents 143 device profiles plus `playwright install` setup.

MCP Hub 332MCP Configs

Docker Selenium Grid — Containerized Browser Testing at Scale

Docker Selenium provides pre-built container images to run Selenium Grid with Chrome, Firefox, and Edge, enabling scalable browser automation in CI/CD pipelines.

Script Depot 331Skills

WebdriverIO — Next-Gen Browser and Mobile Testing Framework

WebdriverIO is a progressive automation framework for web and mobile testing built on the WebDriver and Chrome DevTools protocols with a rich plugin system.

AI Open Source 318Skills

Surf CLI — Control Chrome for AI Agents (No MCP)

Surf CLI lets agents control Chrome via a local extension + native host, offering agent-agnostic browser control without running an MCP server.

Script Depot 314CLI Tools

notebooklm-py — NotebookLM CLI + Python API Skill

notebooklm-py provides a CLI + Python API for NotebookLM with agent hooks; verified 13,142★ and uses Playwright Chromium for browser login.

Skill Factory 299Skills

Chrome MCP Server — Extension-Based Browser MCP

mcp-chrome turns Chrome into an MCP server via an extension + bridge. Install mcp-chrome-bridge, load the extension, then connect to 127.0.0.1:12306/mcp.

MCP Hub 274MCP Configs

bb-browser — Browser-as-API CLI + MCP Server

Use your real Chrome login state as an API: bb-browser provides a CLI + MCP server with 103 commands across 36 platforms (Twitter/Reddit/YouTube/etc.).

MCP Hub 273MCP Configs

Browserbase — Cloud Browser Infra for AI Agents

Browserbase runs managed cloud Chromium for AI agents. Stagehand, Playwright, Puppeteer compatible. Scales to 1000s of parallel sessions with replay.

Browserbase 251Workflows

CloakBrowser — Stealth Chromium for AI Browser Automation

CloakBrowser is a stealth Chromium fork that passes every major bot detection test, offering a drop-in Playwright replacement with source-level fingerprint patches for reliable AI-driven browser automation.

AI Open Source 167Configs

Chromedp — Drive Browsers with the Chrome DevTools Protocol in Go

Pure Go library for controlling browsers via the Chrome DevTools Protocol, without external dependencies like Selenium or PhantomJS.

AI Open Source 91Configs

MCP Config: Chrome Beta (Proxy Mode)

Chrome Beta browser MCP proxy configuration (portable, $HOME-based via bash -c). Bridges CDP WebSocket on port 9222 to MCP on port 9402. Requires chrome-beta-mcp-proxy.sh + cdp-proxy.mjs in ~/scripts (see Multi-Browser MCP Proxies asset)

henuwangkai 76MCP Configs

Rod — Chrome DevTools Protocol Driver for Go

A high-level Go library for controlling browsers via the Chrome DevTools Protocol, designed for web automation, scraping, and testing.

Script Depot 67Scripts

Squoosh — Browser-Based Image Compression by Google

Squoosh is an open-source web app from Google Chrome Labs that compresses images using codecs like MozJPEG, AVIF, WebP, and OxiPNG directly in the browser with no server uploads required.

Script Depot 38Scripts

Chrome DevTools MCP — AI Browser Debugging via Model Context Protocol

An MCP server that lets AI coding agents control and inspect live Chrome browsers for automated debugging, performance analysis, and testing.

Script Depot 36Scripts

Camofox — Stealth Headless Browser for AI Agent Automation

A drop-in Puppeteer and Playwright replacement that bypasses Cloudflare, bot detection, and anti-scraping measures, enabling reliable browser automation for AI agents.

Script Depot 36Scripts

agent-browser — Browser Automation CLI for AI Agents

Rust browser automation CLI for agents; verified 32,921 stars with Chrome-for-Testing install and snapshot+click+fill commands for repeatable runs.

Script Depot 7CLI Tools

Agents IA para el navegador

AI Browser Agents

Browser automation has evolved from brittle CSS selectors to intelligent AI agents that understand web pages like humans do. Visual Web Agents — AI tools like Browser Use and LaVague navigate websites using visual understanding and natural language instructions. They click buttons, fill forms, and extract data without writing a single selector.

Chrome MCP Servers — Connect Claude Code and other AI assistants to a live browser via Model Context Protocol. Take screenshots, run JavaScript, inspect network requests, and interact with web pages — all from your AI coding tool. Essential for debugging, testing, and automating web workflows.

Web Scraping — AI-powered scrapers that understand page structure, handle dynamic content (SPAs, infinite scroll, lazy loading), and extract structured data without manual configuration. E2E Testing — AI agents that generate and maintain Playwright or Puppeteer test suites from natural language descriptions of user flows. They auto-heal broken selectors and adapt to UI changes.

The browser is the universal API — AI agents that can navigate it can automate anything.

Preguntas frecuentes

¿Cuál es la mejor herramienta de IA para automatizar el navegador?+

Para automatización general, Browser Use y LaVague son los Agents de IA navegador líderes — navegan los sitios usando comprensión visual. Para workflows de desarrolladores, los servidores Chrome MCP (que conectan herramientas de IA a un navegador en vivo) son los más prácticos. Para testing, Playwright con generación de tests potenciada por IA ofrece el mejor equilibrio entre fiabilidad y facilidad de uso.

¿Cómo funcionan los Agents de IA para el navegador?+

Los Agents de IA para navegador toman capturas de pantalla de las páginas web, usan modelos de visión para entender el layout y el contenido, y generan acciones click/type/scroll para cumplir objetivos. A diferencia de la automatización tradicional (que usa selectores CSS), los Agents de IA se adaptan automáticamente a los cambios de UI. Pueden seguir instrucciones multi-paso como "encuentra la página de precios y extrae todas las features de cada plan en una tabla".

¿Puede la IA reemplazar al web scraping tradicional?+

En muchos casos de uso, sí. Los scrapers con IA entienden la estructura de las páginas sin configuración manual de selectores, gestionan contenido renderizado por JavaScript de forma nativa y se adaptan cuando los sitios cambian su layout. Son especialmente fuertes para extracción de datos no estructurados. Sin embargo, para scraping de producción de alto volumen, las herramientas tradicionales con selectores explícitos siguen siendo más fiables y rápidas.

Explora categorías relacionadas

Herramientas de IA para Web Scraping Herramientas de IA para Automation Herramientas de IA para Testing Herramientas de IA para Coding