为什么选它
MetaGPT’s thesis: multi-agent systems improve when you give them a Standard Operating Procedure. Instead of letting agents improvise coordination, MetaGPT hard-codes the waterfall of a software team — PM writes PRD, architect writes design doc, engineer writes code, QA writes tests — with structured artifacts flowing between stages. The framework reaches state-of-the-art on code generation benchmarks largely because of that discipline.
It’s opinionated in a way CrewAI isn’t. You don’t define arbitrary roles; you pick from a set of battle-tested ones and customize. That’s a feature for software projects and a limit for other domains. For a "write me an app" demo, MetaGPT is still the benchmark reference in 2026.
The newer Data Interpreter and MGX (multi-agent X) extensions broaden the scope — data science, document generation, and general task execution with SOP scaffolding. Worth tracking if your workflow is "structured but not software".
Quick Start — Software Company from One Prompt
generate_repo is the shortcut. Under the hood MetaGPT instantiates ProductManager → Architect → ProjectManager → Engineer → QaEngineer with message flow scripted by the SOP. For custom SOPs, subclass Role and register new Actions; see the MetaGPT docs on Roles and Actions.
# pip install metagpt
# metagpt --init-config # creates ~/.metagpt/config2.yaml — set OPENAI_API_KEY
import asyncio
from metagpt.software_company import generate_repo
async def main():
repo = generate_repo(
idea="Create a simple command-line pomodoro timer with start/pause/reset.",
investment=3.0, # USD cap on LLM spend
n_round=5, # max rounds of agent collaboration
)
print(repo) # path to the generated project folder
asyncio.run(main())
# Inside the generated folder you get:
# prd.md (product manager output)
# design.md (architect output)
# task.md (project manager output)
# src/pomodoro.py (engineer output)
# tests/test_pom.py (QA output)
# Each artifact is a structured contract for the next agent.核心能力
Predefined software roles
ProductManager, Architect, ProjectManager, Engineer, QaEngineer — each with specialized system prompts, tools, and output schemas. Ready-made for "build this app" workflows.
SOP-driven message passing
Not free-form conversation. Each role produces a structured artifact (PRD, system design, task list, code) that becomes the input contract for the next role. Prevents hallucination drift.
Data Interpreter
A generalist data-analysis agent mode. Input a dataset + question; get code + results + explanation. Competitive with ChatGPT Advanced Data Analysis on many benchmarks.
Multi-agent environments
Roles live in a shared Environment that routes messages and simulates turns. Extend by adding new Roles or customizing the SOP graph.
Tool use + Code execution
Engineers can run generated code; QA agents execute tests. Sandboxed via configurable executors (local, Docker).
Multilingual prompts
Strong Chinese + English support — the framework is Chinese-led (DeepWisdom) and prompts are written to work well across both.
对比
| Specialization | Opinionation | Best Domain | Flexibility | |
|---|---|---|---|---|
| MetaGPTthis | Software dev SOPs | Very high | Code / structured tasks | Medium (custom SOPs possible) |
| CrewAI | General | Medium | Any role-based pipeline | High |
| AutoGen | General | Low | Research, open-ended | Very high |
| LangGraph | General | Low | Complex control flow | Very high |
实际用例
01. Prototype app generation
Go from "I want a tool that does X" to a running repo in minutes. Great for internal tools, hackathon starters, and exploring ideas before committing to a real implementation.
02. Specification-first pipelines
Any workflow where "write a plan first, then execute" works better than freestyle. MetaGPT’s SOP enforces that discipline — adapt it to non-software domains via custom Roles.
03. Data analysis automation
Data Interpreter mode handles "here’s a CSV, tell me X and show me Y" in one call. Useful embedded in BI tools or analysis assistants.
价格与许可
MetaGPT: MIT open source. Free to self-host. Config via ~/.metagpt/config2.yaml for LLM keys and model selection.
Model cost: the SOP produces a lot of structured text. A single generate_repo run typically costs $0.50-$5 on gpt-4o-class models. Cap via the investment argument to avoid runaway spend.
Commercial offering: MetaGPT’s parent DeepWisdom offers MGX and enterprise consulting. Not required for the OSS framework.
常见问题
Is MetaGPT only for code generation?+
It originated there and remains strongest there. The Data Interpreter mode extends to analytical tasks. General-purpose multi-agent work is possible via custom Roles, but in most cases CrewAI is simpler for non-software domains.
Does MetaGPT work with non-OpenAI models?+
Yes. config2.yaml supports OpenAI, Anthropic Claude, Gemini, Zhipu GLM, Ollama, and other OpenAI-compatible endpoints. Claude 3.5 and GPT-4-class models give the best results; smaller models produce unreliable structured artifacts.
How does MetaGPT compare to Devin / autonomous coding agents?+
MetaGPT generates a project from a spec; Devin-style tools iterate on existing codebases with human feedback. Complementary — use MetaGPT to bootstrap, hand off to a coding agent or real developer for evolution.
Can I customize the software-company SOP?+
Yes. Subclass Role and override Actions. You can prune roles (skip QA for a prototype), add new ones (Designer, Researcher), or replace the entire SOP graph. Documented in the "Customize roles" guide.
Is MetaGPT production-ready?+
For generating prototypes and scaffolds, yes. For production code you run unattended, no — like all agentic code generation, treat output as a starting point that needs human review.