# LaVague — Natural Language Web Automation

> Give a text objective, LaVague drives the browser to accomplish it. Large Action Model framework for web agents. 6.3K+ stars.

## Install

Copy the content below into your project:

# LaVague — Natural Language Web Automation

## Quick Use

```bash
pip install lavague
```

```python
from lavague.core import WorldModel, ActionEngine
from lavague.core.agents import WebAgent
from lavague.drivers.selenium import SeleniumDriver

driver = SeleniumDriver(headless=False)
agent = WebAgent(
    WorldModel(),
    ActionEngine(driver),
)

agent.get("https://huggingface.co/docs")
agent.run("Find the installation instructions for the Diffusers library")

# The agent navigates, clicks, and scrolls to find the answer
print(agent.result)
```

---

## Intro

LaVague is a Large Action Model (LAM) framework with 6,300+ GitHub stars for building AI web agents that automate browser interactions using natural language objectives. Instead of writing CSS selectors or XPath queries, you describe what you want to achieve and LaVague's agent navigates websites, clicks buttons, fills forms, and extracts information autonomously. It provides full observability into each step — generated actions, page observations, and decision reasoning — making it ideal for building reliable, debuggable web automation pipelines.

Works with: Selenium, OpenAI GPT-4, Anthropic Claude, any OpenAI-compatible API. Best for teams building web scraping agents, QA automation, or AI-powered web workflows. Setup time: under 3 minutes.

---

## LaVague Architecture

### Core Components

```
User Objective: "Book the cheapest flight from NYC to London"
    |
    +-- World Model (LLM)
    |     Understands the page and plans next action
    |
    +-- Action Engine
    |     Generates Selenium code to execute the action
    |
    +-- Selenium Driver
          Executes actions in the browser
```

### How It Works

1. **Observe** — The agent takes a screenshot and reads the page DOM
2. **Think** — The World Model (LLM) decides the next action based on the objective
3. **Act** — The Action Engine generates and executes Selenium code
4. **Repeat** — Until the objective is achieved or max steps reached

### Step-by-Step Observability

```python
agent = WebAgent(WorldModel(), ActionEngine(driver))
agent.get("https://example.com")

# Enable detailed logging
for step in agent.run_step_by_step("Find pricing information"):
    print(f"Step {step.number}:")
    print(f"  Observation: {step.observation}")
    print(f"  Thought: {step.thought}")
    print(f"  Action: {step.action_code}")
    print(f"  Result: {step.result}")
```

### Use Cases

| Use Case | Example |
|----------|--------|
| **Web scraping** | "Extract all product prices from this catalog" |
| **Form filling** | "Fill out this job application with my resume data" |
| **QA testing** | "Test the checkout flow and verify the order total" |
| **Research** | "Find the latest papers on RAG from arXiv" |
| **Monitoring** | "Check if the deployment status page shows all green" |

### Multi-Step Workflows

```python
# Chain multiple objectives
agent.get("https://shopping-site.com")
agent.run("Search for wireless headphones under $50")
agent.run("Sort by customer rating")
agent.run("Extract the top 5 results with names and prices")
results = agent.result
```

### Configuration

```python
from lavague.core import WorldModel

# Use Claude instead of GPT
world_model = WorldModel(
    model_name="anthropic/claude-sonnet-4-6",
    api_key="sk-ant-..."
)

# Headless mode for CI/CD
driver = SeleniumDriver(headless=True)
```

---

## FAQ

**Q: What is LaVague?**
A: LaVague is a Large Action Model framework with 6,300+ GitHub stars for building AI web agents that automate browser tasks using natural language objectives, with full step-by-step observability.

**Q: How is LaVague different from Browser Use or Stagehand?**
A: LaVague focuses on objective-driven automation — you state what you want to achieve, not the individual steps. Browser Use is a Python agent framework. Stagehand provides three TypeScript primitives. LaVague emphasizes observability and debugging for production web automation.

**Q: Is LaVague free?**
A: Yes, open-source under Apache-2.0. You bring your own LLM API keys.

---

## Source & Thanks

> Created by [LaVague AI](https://github.com/lavague-ai). Licensed under Apache-2.0.
>
> [LaVague](https://github.com/lavague-ai/LaVague) — ⭐ 6,300+

---

<!-- ZH -->

## 快速使用

```bash
pip install lavague
```

```python
from lavague.core import WorldModel, ActionEngine
from lavague.core.agents import WebAgent
from lavague.drivers.selenium import SeleniumDriver

driver = SeleniumDriver(headless=False)
agent = WebAgent(WorldModel(), ActionEngine(driver))
agent.get("https://huggingface.co/docs")
agent.run("找到 Diffusers 库的安装步骤")
```

---

## 简介

LaVague 是一个拥有 6,300+ GitHub stars 的大语言动作模型 (LAM) 框架，用于构建基于自然语言目标的 AI 浏览器代理。描述你想完成什么，代理自动导航、点击、填写表单并提取信息。每一步都有完整的可观测性。

---

## 来源与感谢

> Created by [LaVague AI](https://github.com/lavague-ai). Licensed under Apache-2.0.
>
> [LaVague](https://github.com/lavague-ai/LaVague) — ⭐ 6,300+


---
Source: https://tokrepo.com/en/workflows/9f75bd64-ffee-46f1-b6db-b6afc9825a62
Author: TokRepo精选