Browser Automation

Best AI Tools for Browser Automation (2026)

AI-powered browser agents, web scraping tools, and testing frameworks. Automate web interactions with natural language instructions.

30 tools

Chrome DevTools MCP — Browser Debugging for AI Agents

Give your AI coding agent full access to Chrome DevTools for browser automation, debugging, and performance analysis. Works with Claude, Cursor, Copilot, and 15+ AI tools.

MCP Hub 21MCP Configs
🧩

Stagehand — AI-Powered Browser Automation SDK

TypeScript SDK that lets you automate browsers using natural language and visual understanding. AI sees the page like a human does. Built on Playwright. 10,000+ GitHub stars.

Skill Factory 19Skills

Claude Official Skill: webapp-testing

Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots...

Skill Factory 17Skills

Nanobrowser — AI Web Automation Chrome Extension

Open-source Chrome extension with multi-agent AI for web automation. Free alternative to OpenAI Operator. 12K+ stars.

AI Open Source 16Configs
📜

Pydoll — Browser Automation Without WebDriver

Python async browser automation via Chrome DevTools Protocol. Built-in CAPTCHA solving, anti-detection, no Selenium needed. 6.7K+ stars.

Script Depot 11Scripts

Playwright MCP — Browser Automation for AI Agents

Official Playwright MCP server that gives AI agents full browser control — navigate pages, fill forms, click buttons, take screenshots, and run end-to-end tests. 3,000+ stars.

MCP Hub 8MCP Configs

Playwright MCP Server — Browser Automation for AI

Official Playwright MCP server for browser automation. Lets Claude Code, Cursor, and other AI tools navigate pages, fill forms, click buttons, take screenshots, and run end-to-end tests via natural la

MCP Hub 6MCP Configs

Chrome MCP Background Proxy — Fix Popups, Focus Stealing & Multi-Agent Conflicts

Drop-in proxy for Chrome MCP. Solves Chrome 146+ debugging popups, focus stealing, and multi-agent conflicts with persistent CDP connection, request ID remapping, and sessionId-based event routing.

henuwangkai 83MCP ConfigsScripts

Puter — Open-Source Cloud Desktop in Your Browser

Self-hostable internet OS with file system, app ecosystem, and developer APIs. Privacy-first cloud desktop. 40K+ GitHub stars.

AI Open Source 26Configs

Remotion Rule: Can Decode

Remotion skill rule: Check if a video can be decoded by the browser using Mediabunny. Part of the official Remotion Agent Skill for programmatic video in React.

Skill Factory 24Skills
🔌

Browser Use — AI Agent Browser Automation

Let AI agents control web browsers with natural language. Browser Use provides vision-based element detection, multi-tab support, and works with any LLM provider.

MCP Hub 23MCP Configs

Browser-Use Web UI — Visual AI Browser Automation

Gradio-based web interface for Browser-Use AI agent. Automate web browsing with visual feedback, persistent sessions, and HD recording. Supports 6+ LLM providers. 15,800+ stars, MIT.

Script Depot 22Scripts

LaVague — Natural Language Web Automation

Give a text objective, LaVague drives the browser to accomplish it. Large Action Model framework for web agents. 6.3K+ stars.

AI Open Source 22Knowledge

Devika — Open Source AI Software Engineer

Open-source AI software engineer that plans, researches, and writes code autonomously. Supports Claude, GPT, and local models with browser and terminal access.

Agent Toolkit 20Scripts

Claude Quickstarts — 5 Official Deployable AI App Templates

Anthropic's official quickstart collection with 5 deployable AI apps: customer support agent, financial analyst, computer use, browser tools, and autonomous coder.

TokRepo精选 20Skills

Chrome MCP Proxy

CDP proxy for Chrome DevTools MCP — persistent connection, multi-agent isolation, anti-focus-stealing. Built for Claude Code.

henuwangkai 20MCP ConfigsScripts

Bolt.new — AI Full-Stack Web App Generator

Prompt, run, edit, and deploy full-stack web apps in the browser. AI generates code, installs packages, runs dev server, and deploys — all from a chat interface. 16K+ stars.

AI Open Source 17Configs

Crawlee — Production Web Scraping for Node.js

Build reliable crawlers with automatic proxy rotation, request queuing, and browser automation. By Apify. 22K+ stars.

Script Depot 15Scripts
⚙️

Karakeep — AI-Powered Bookmark Manager

Karakeep (formerly Hoarder) is a self-hosted bookmark manager with AI tagging and full-text search. 24.4K+ stars. Browser extensions, mobile apps. AGPL-3.0.

AI Open Source 15Configs
📜

Browser Use — AI Agent Browser Automation

Make any website accessible to AI agents. Automate browser tasks with LLMs — click, type, navigate, extract data. 70K+ stars, MIT licensed.

Script Depot 14Scripts
📜

MLC-LLM — Universal LLM Deployment Engine

Deploy any LLM on any hardware — phones, browsers, GPUs, CPUs. Compiles models for native performance on iOS, Android, WebGPU, CUDA, Metal, and Vulkan. 22K+ stars.

Script Depot 14Scripts

Crawlee — Web Scraping and Browser Automation Library

Build reliable web scrapers in Node.js or Python. Crawlee handles proxy rotation, browser fingerprints, auto-scaling, and anti-bot bypassing out of the box.

MCP Hub 13Scripts

Skyvern — AI Visual Browser Automation Agent

Automate any website using LLMs and computer vision. No selectors needed — works on sites never seen before. 21K+ stars.

Agent Toolkit 12Workflows
📜

Stagehand — AI Browser Automation Framework

Three AI primitives — act(), extract(), observe() — to automate any website with natural language. By Browserbase. 21K+ stars.

Script Depot 11Scripts

Crawl4AI — LLM-Friendly Web Crawler

Open-source web crawler that outputs clean Markdown for AI. Structured extraction, browser automation, anti-bot handling. 63K+ stars.

Script Depot 11Scripts

React Email — Build and Send Emails Using React

React Email lets developers create beautiful HTML emails with React components, preview in browser, and send via Resend, Nodemailer, or any ESP.

Script Depot 9Scripts
📜

Qwen-Agent — Multi-Tool AI Agent Framework

Build AI agents with Alibaba Qwen models featuring tool calling, code execution, RAG, and browser control. Supports MCP protocol and custom tool development.

Prompt Lab 9Scripts

Bolt.new — Full-Stack App Generator in the Browser

AI-powered development environment that generates, runs, and deploys full-stack web applications entirely in the browser. Describe your app, watch it build in real-time. By StackBlitz. 15,000+ stars.

Skill Factory 9Skills
📜

Browser Use — AI Browser Automation

Open-source Python library for AI-driven browser automation. Works with Claude, GPT, and Gemini to fill forms, scrape data, and navigate websites.

Script Depot 7Scripts
🔌

Notte — Browser Automation MCP for AI Agents

MCP server that turns web browsers into AI agent tools. Notte provides structured browser actions like click, type, navigate, and extract for LLM-driven automation.

MCP Hub 6MCP Configs

AI Browser Agents

AI Browser Agents

Browser automation has evolved from brittle CSS selectors to intelligent AI agents that understand web pages like humans do. Visual Web Agents — AI tools like Browser Use and LaVague navigate websites using visual understanding and natural language instructions. They click buttons, fill forms, and extract data without writing a single selector.

Chrome MCP Servers — Connect Claude Code and other AI assistants to a live browser via Model Context Protocol. Take screenshots, run JavaScript, inspect network requests, and interact with web pages — all from your AI coding tool. Essential for debugging, testing, and automating web workflows.

Web Scraping — AI-powered scrapers that understand page structure, handle dynamic content (SPAs, infinite scroll, lazy loading), and extract structured data without manual configuration. E2E Testing — AI agents that generate and maintain Playwright or Puppeteer test suites from natural language descriptions of user flows. They auto-heal broken selectors and adapt to UI changes.

The browser is the universal API — AI agents that can navigate it can automate anything.

Frequently Asked Questions

What is the best AI tool for browser automation?+

For general automation, Browser Use and LaVague are leading AI-native browser agents — they navigate sites using visual understanding. For developer workflows, Chrome MCP servers (connecting AI tools to a live browser) are the most practical. For testing, Playwright with AI-powered test generation offers the best balance of reliability and ease of use.

How do AI browser agents work?+

AI browser agents take screenshots of web pages, use vision models to understand the layout and content, and generate click/type/scroll actions to accomplish goals. Unlike traditional automation (which uses CSS selectors), AI agents adapt to UI changes automatically. They can follow multi-step instructions like "find the pricing page and extract all plan features into a table."

Can AI replace traditional web scraping?+

For many use cases, yes. AI scrapers understand page structure without manual selector configuration, handle JavaScript-rendered content natively, and adapt when websites change their layout. They're particularly strong for unstructured data extraction. However, for high-volume production scraping, traditional tools with explicit selectors remain more reliable and faster.

Explore Related Categories