ConfigsApr 2, 2026·4 min read

Nanobrowser — AI Web Automation Chrome Extension

Open-source Chrome extension with multi-agent AI for web automation. Free alternative to OpenAI Operator. 12K+ stars.

TL;DR
Nanobrowser is an open-source Chrome extension that automates web tasks using multi-agent AI with natural language instructions.
§01

What it is

Nanobrowser is an open-source Chrome extension that brings multi-agent AI to web automation. Instead of writing scripts or recording macros, you describe what you want done in natural language and the AI agents navigate, click, fill forms, and extract data from web pages autonomously. It serves as a free alternative to commercial browser automation tools.

Users who need to automate repetitive web tasks -- data extraction, form filling, testing, or multi-step workflows -- without writing code benefit most. The multi-agent architecture means different agents handle navigation, extraction, and validation in parallel.

§02

How it saves time or tokens

Nanobrowser eliminates the need to write and maintain browser automation scripts. Traditional tools like Selenium or Playwright require code for every interaction. Nanobrowser translates a single natural language instruction into a sequence of browser actions. The multi-agent design handles dynamic pages, pop-ups, and authentication flows that break simple scripting approaches. Updates to website layouts do not require script maintenance because the AI adapts to visual changes.

§03

How to use

  1. Install Nanobrowser from the Chrome Web Store or build from source:
git clone https://github.com/nicepkg/nanobrowser.git
cd nanobrowser
npm install && npm run build
  1. Load the extension in Chrome via chrome://extensions with Developer Mode enabled.
  1. Open the Nanobrowser sidebar and describe your task:
Go to amazon.com, search for 'mechanical keyboard', and extract the top 5 product names and prices

The agents navigate the page, perform the search, and return structured results.

§04

Example

# Task: Fill out a multi-step form
Nanobrowser prompt:
'Go to forms.example.com/application,
 fill in name as John Smith,
 email as john@example.com,
 click Next,
 select Experience Level: Senior,
 click Submit'

# Task: Monitor a page for changes
'Check news.ycombinator.com every 5 minutes,
 alert me if a post about AI regulation reaches the front page'

# Task: Extract data into a table
'Go to github.com/trending, extract repo name, stars, and language for all repos on the page'
§05

Related on TokRepo

§06

Common pitfalls

  • Nanobrowser requires an LLM API key (OpenAI, Anthropic, etc.) for the AI agents. Each automation task consumes tokens proportional to the complexity of the page and task.
  • CAPTCHAs and anti-bot measures can block automated interactions. Nanobrowser works best on internal tools or sites without aggressive bot detection.
  • Complex multi-step workflows may require breaking the task into smaller instructions. A single prompt that tries to do too much can confuse the agent planning.

Frequently Asked Questions

Is Nanobrowser free?+

Yes. Nanobrowser is open source and free to use. You need to provide your own LLM API key for the AI agents, which incurs token costs from the model provider. The extension itself has no licensing fees.

Which LLM providers does Nanobrowser support?+

Nanobrowser supports OpenAI, Anthropic, and other providers compatible with the OpenAI chat completion format. You configure the API key and model in the extension settings.

Can Nanobrowser handle dynamic pages with JavaScript?+

Yes. Since Nanobrowser runs as a Chrome extension, it interacts with fully rendered pages including JavaScript-heavy SPAs. The agents see the same DOM that a human user sees.

How does multi-agent architecture help?+

Different agents specialize in different tasks: one handles navigation, another handles data extraction, another validates results. This specialization improves reliability compared to a single agent trying to do everything sequentially.

Can I save and replay automation workflows?+

Nanobrowser supports saving task descriptions that you can rerun later. Since the agents interpret natural language each time, they adapt to minor page layout changes between runs.

Citations (3)
🙏

Source & Thanks

Created by Alex Chen. Licensed under Apache-2.0.

nanobrowser — ⭐ 12,600+

Thanks to Alex Chen for building a truly free and private AI browser automation tool.

Discussion

Sign in to join the discussion.
No comments yet. Be the first to share your thoughts.

Related Assets