# Nanobrowser — AI Web Automation Chrome Extension > Open-source Chrome extension with multi-agent AI for web automation. Free alternative to OpenAI Operator. 12K+ stars. ## Install Save in your project root: # Nanobrowser — AI Web Automation Chrome Extension ## Quick Use 1. Install from [Chrome Web Store](https://chromewebstore.google.com/detail/nanobrowser/imbddededgmcgfhfpcjmijokokekbkal) or download from GitHub Releases 2. Click the Nanobrowser icon in Chrome toolbar 3. Enter your LLM API key (OpenAI, Anthropic, or Gemini) 4. Type a task in natural language: ``` Find the cheapest flight from NYC to London next Friday on Google Flights ``` 5. Watch the AI agents navigate, click, and extract data in real-time For local/custom models, configure an OpenAI-compatible endpoint (Ollama, vLLM, etc.) in Settings. --- ## Intro Nanobrowser is an open-source Chrome extension with 12,600+ GitHub stars that brings AI-powered web automation directly into your browser. Positioned as a free alternative to OpenAI Operator, it uses a multi-agent architecture where a Planner agent breaks down tasks and a Navigator agent executes browser actions with intelligent self-correction. Everything runs locally in your browser using your own API keys — no subscription fees, no data sent to third-party servers. Supports OpenAI, Anthropic, Gemini, Ollama, Groq, and any OpenAI-compatible endpoint. Works with: Chrome, Edge, OpenAI GPT-4o, Anthropic Claude, Google Gemini, Ollama, Groq, Cerebras. Best for users who want AI browser automation without coding or subscriptions. Setup time: under 2 minutes. --- ## Nanobrowser Multi-Agent Architecture ### Agent Roles ``` ┌──────────────────────────────┐ │ User Request │ │ "Book a table for 2 at..." │ └──────────┬───────────────────┘ │ ┌──────▼──────┐ │ Planner │ Breaks task into steps │ Agent │ Monitors progress └──────┬──────┘ Self-corrects on errors │ ┌──────▼──────┐ │ Navigator │ Executes browser actions │ Agent │ Clicks, types, scrolls └──────┬──────┘ Reads page content │ ┌──────▼──────┐ │ Validator │ Checks task completion │ Agent │ Reports results └─────────────┘ ``` ### How It Works 1. **Task Decomposition** — The Planner agent analyzes your request and creates a step-by-step plan 2. **Web Navigation** — The Navigator agent executes actions: clicking buttons, filling forms, scrolling pages 3. **Self-Correction** — When obstacles arise (popups, CAPTCHAs, changed layouts), agents adapt their strategy 4. **Result Delivery** — The Validator confirms task completion and returns results in the side panel ### Flexible Model Assignment Use different models for different agents to optimize cost and performance: | Agent | Recommended Model | Why | |-------|------------------|-----| | **Planner** | GPT-4o / Claude Sonnet | Needs strong reasoning for task decomposition | | **Navigator** | GPT-4o-mini / Gemini Flash | Needs fast responses for real-time navigation | | **Validator** | Any model | Simple verification tasks | ### Supported Providers | Provider | Models | Setup | |----------|--------|-------| | **OpenAI** | GPT-4o, GPT-4o-mini, o1 | API key | | **Anthropic** | Claude Sonnet, Claude Haiku | API key | | **Google** | Gemini Pro, Gemini Flash | API key | | **Ollama** | Llama, Mistral, Qwen (local) | Local endpoint | | **Groq** | Llama 3, Mixtral (fast inference) | API key | | **Cerebras** | Llama 3 (ultra-fast) | API key | | **Custom** | Any OpenAI-compatible API | Base URL + key | ### Privacy & Security - **100% local execution** — All processing happens in your browser - **Your keys, your data** — API keys stored locally, never sent to Nanobrowser servers - **No subscription** — Completely free, pay only for your LLM API usage - **Open source** — Full code audit available on GitHub ### Example Tasks ``` 📋 "Go to Amazon, search for wireless earbuds under $50, and list the top 5 rated ones" 📋 "Open Google Scholar, search for 'transformer architecture', and extract the top 10 paper titles with citation counts" 📋 "Navigate to GitHub trending, filter by Python, and list today's top repositories" 📋 "Go to my Gmail, find the latest email from John, and summarize it" 📋 "Search for 3-bedroom apartments in Austin TX on Zillow under $2000/month" ``` --- ## FAQ **Q: What is Nanobrowser?** A: Nanobrowser is a free, open-source Chrome extension with 12,600+ GitHub stars that uses AI multi-agent collaboration to automate web tasks directly in your browser. A free alternative to OpenAI Operator. **Q: Is Nanobrowser safe? Does it send my data anywhere?** A: Yes, it's safe. Everything runs locally in your browser. Your API keys and browsing data are stored locally and never sent to Nanobrowser's servers. The only external calls are to your configured LLM provider. **Q: Is Nanobrowser free?** A: Yes, the extension is 100% free and open-source under Apache-2.0. You only pay for LLM API usage from your chosen provider (OpenAI, Anthropic, etc.). --- ## Source & Thanks > Created by [Alex Chen](https://github.com/alexchenzl). Licensed under Apache-2.0. > > [nanobrowser](https://github.com/nanobrowser/nanobrowser) — ⭐ 12,600+ Thanks to Alex Chen for building a truly free and private AI browser automation tool. --- ## 快速使用 1. 从 Chrome 应用商店安装 Nanobrowser,或从 GitHub Releases 下载 2. 点击 Chrome 工具栏中的 Nanobrowser 图标 3. 输入你的 LLM API 密钥(OpenAI、Anthropic 或 Gemini) 4. 用自然语言描述任务,例如:"在 Google Flights 上查找下周五从纽约到伦敦最便宜的机票" 5. 实时观看 AI 代理自动导航、点击和提取数据 --- ## 简介 Nanobrowser 是一个拥有 12,600+ GitHub stars 的开源 Chrome 扩展,将 AI 驱动的网页自动化直接嵌入浏览器。被称为 OpenAI Operator 的免费替代品,使用多 Agent 架构:Planner 分解任务,Navigator 执行浏览器操作,遇到障碍时智能自纠正。一切在本地浏览器中运行,使用你自己的 API 密钥,无需订阅费。 适用于:Chrome、Edge、OpenAI、Anthropic Claude、Gemini、Ollama。适合希望不写代码就能实现 AI 浏览器自动化的用户。 --- ## 核心特性 ### 多 Agent 协作 Planner 负责任务分解和监控,Navigator 负责执行浏览器操作,遇到问题自动修正策略。 ### 灵活的模型分配 为不同 Agent 分配不同模型,优化成本:Planner 用强模型推理,Navigator 用快速模型执行。 ### 隐私优先 100% 本地运行,API 密钥本地存储,无数据上传到第三方。 ### 完全免费 开源免费,只需支付你选择的 LLM 提供商的 API 费用。 --- ## 来源与感谢 > Created by [Alex Chen](https://github.com/alexchenzl). Licensed under Apache-2.0. > > [nanobrowser](https://github.com/nanobrowser/nanobrowser) — ⭐ 12,600+ --- Source: https://tokrepo.com/en/workflows/96b4872e-a870-40ad-b725-c35f20a99ca7 Author: TokRepo精选