TOKREPO · 主题包

本周新建

AI 测试生成 + E2E 全家桶

想让 AI 把缺失的单测补齐、从规格生成 property 测试、用真浏览器跑 E2E、CI 里自动 triage 快照的开发者，这十个就是顺序：Test Engineer agent + Vitest + Jest + pytest + Hypothesis + MSW + Playwright + Playwright MCP + Playwright Tester agent + verify-app — 按推荐安装顺序排列。

10 个资产

关于这个主题包

这个 pack 包含什么

这是给那个终于承认「AI 写无聊测试比我快」的工程师准备的 pack — 那些 6 种字符串排列的 validator 测试、14 个字段的接口测试、点穿整个 onboarding 的 E2E。你不想再手写了。你要的是一整套系统：一个 agent 读文件出测试计划，一个不让你等的 runner，一个补齐你脑子里漏掉的 case 的 property 生成器，一个不让测试打到生产的 mock，一个 agent 能开的真浏览器，一个 CI 里 triage 红灯的 subagent。

下面这十个就是那套系统的安装顺序。JavaScript/TypeScript 是脊柱（现代 web 大部分），Python 是第二条线（AI / 数据那一侧），中间靠 Test Engineer agent 按语言挑 runner 衔接。每一个都开源、都在 TokRepo 上，AI 编码 agent 在 session 里直接装得了。

目标读者：手里有个真实代码仓库，覆盖率 <40%，sprint 时段尝试过写测试然后放弃，现在想让 Claude / Codex / Cursor 在你监督下把这事儿干了。装完第 10 个，你会有 unit + integration + E2E + snapshot 测试在每次 commit 上跑，外加一个会读失败日志、用大白话告诉你哪儿坏了的 subagent。

它们怎么协同

Test Engineer (#1)
   │
   └─ 读代码库、挑 runner、起草计划
         │
单测层:
   Vitest (#2)  ←  主力（Vite 系）
   Jest (#3)    ←  兜底（legacy / 非 Vite）
   pytest (#4)  ←  Python 侧
         │
Property 层:
   Hypothesis (#5) 从规格生成输入
         │
集成边界:
   MSW (#6) mock 网络，测试保持 hermetic
         │
E2E 层:
   Playwright (#7) — 框架
   Playwright MCP (#8) — agent 开真浏览器
   Playwright Tester agent (#9) — 写 spec
         │
CI 层:
   verify-app subagent (#10) — 在 diff 上跑 E2E，triage 失败

Test Engineer + Hypothesis + verify-app 这三件套是 agent 化的脊柱。拿掉这三个，剩下的就是普通测试栈。留着这三个，循环就闭合：agent 规划、生成器抓边界、runner 报告、subagent triage，你做决策不是敲断言。

你会遇到的取舍

Vitest vs Jest — Vitest 更快、原生 ESM、不用 transform 配置 — 但假设了 Vite。Jest 更老更慢，但在每个出过的 JS 环境里都能跑。经验法则：新项目 = Vitest；继承的代码库 = 跟现有那个。
Hypothesis vs 示例测试 — Property 测试能抓你示例永远想不到的 bug，但更难写难读。Hypothesis 用在纯函数（parser、validator、数学），I/O 重的代码还是用示例（性质只能是「不崩」，没啥用）。
MSW vs 真测试服务器 — MSW 更快更确定性，但它会骗你 — 代码通过的是 mock，可能不符合生产 schema。对策：从 OpenAPI 规格生成 MSW handler，mock 和生产就不会悄悄漂移。
Playwright vs Playwright MCP — 框架跑你写的脚本测试。MCP 让 agent 即兴发挥。两个都要：scripted Playwright 在 CI 里跑回归；Playwright MCP 用来「agent 你点一下新 onboarding 告诉我哪儿坏了」这种临时活。
Test Engineer agent vs 自己写计划 — agent 在策略上 ~80% 正确，对你具体业务不变量经常错。把它的计划当草稿，执行前改一遍。

常见踩坑

一次装 10 个 — 别。挑你的栈（JS 或 Python），先装 1-2-5-6-7-10 这六个，跑出绿色 pipeline，再加剩下的。pack 是菜单不是任务清单。
第一天就让 agent 写 200 个测试 — 质量比数量重要。让 Test Engineer agent 起草 10 个核心路径测试，每个 code review，再扩。200 个平庸测试就是包装精美的技术债。
MSW handler 永远不更新 — 把 MSW handler 当类型定义对待：API 变了就重新生成。stale mock 是「测试全绿」依然 ship 坏代码的元凶。
CI 里跑 Playwright MCP — 别。MCP 是给交互式 session 让 agent 探索用的。CI 应该跑 scripted Playwright spec（更快、可复现、不烧每次 LLM 费用）。用 Playwright Tester (#9) 写 spec，让 CI 确定性地跑。
觉得 verify-app 多余因为「我 CI 已经跑测试了」 — 你的 CI 跑测试。verify-app 解释失败。第一次 Claude Code session 把测试搞挂、verify-app 告诉你哪个 selector 改了，你就懂区别了。
Hypothesis 用在不纯的代码上 — 给一个碰数据库的函数做 property 测试，就是 flake 制造机。先把纯逻辑重构出来，对纯逻辑做 property 测试，I/O 留给示例 pytest。

安装 · 一行命令

$ tokrepo install pack/ai-test-generation-e2e

丢给 agent，或粘到终端

包内含什么

10 个资产打包就绪

Skill#01

Claude Code Agent: Test Engineer

Test automation and quality assurance specialist. Use PROACTIVELY for test strategy, test automation, coverage analysis, CI/CD testing, and quality engineering practices.

by TokRepo精选·15 views

$ tokrepo install claude-code-agent-test-engineer-f3c765fa

Skill#02

Vitest — Next Generation Testing Framework Powered by Vite

Vitest is a blazing-fast unit testing framework powered by Vite, with native ESM, TypeScript, and JSX support. Jest-compatible API, instant HMR for tests, and in-source testing make it the go-to test runner for Vite projects.

by AI Open Source·189 views

$ tokrepo install vitest-next-generation-testing-framework-powered-vite-267275ed

Skill#03

Jest — Delightful JavaScript Testing Framework

Jest is a delightful JavaScript testing framework with a focus on simplicity. Zero-config for most JS/TS projects, snapshot testing, mocking, code coverage, and parallel test execution. Created by Facebook and used to test React, Instagram, and many large codebases.

by AI Open Source·174 views

$ tokrepo install jest-delightful-javascript-testing-framework-4240433c

Skill#04

pytest — The Python Testing Framework That Scales

pytest makes it easy to write small tests, yet scales to support complex functional testing. Fixtures, parameterization, plugins, markers, and a rich assertion introspection system. The de facto testing standard for the Python ecosystem.

by AI Open Source·162 views

$ tokrepo install pytest-python-testing-framework-scales-42405aa1

Skill#05

Hypothesis — Property-Based Testing for Python

Hypothesis is a Python testing library that generates test cases automatically based on specifications you define, finding edge cases that hand-written tests miss.

by AI Open Source·144 views

$ tokrepo install hypothesis-property-based-testing-python-a0e81556

Skill#06

MSW — API Mocking of the Next Generation

Mock Service Worker intercepts network requests at the service worker layer, letting you mock REST and GraphQL APIs for tests and development without stubbing fetch. The same mocks work in Node, jsdom, browsers, and React Native.

by Script Depot·166 views

$ tokrepo install msw-api-mocking-next-generation-dc65506c

Skill#07

Playwright — Cross-Browser End-to-End Testing Framework

Reliable end-to-end testing for modern web apps across Chromium, Firefox, and WebKit with a single API.

by Microsoft AI·145 views

$ tokrepo install playwright-cross-browser-end-end-testing-framework-af656d8e

MCP#08

Playwright MCP — Browser Automation for Agents

Playwright MCP exposes browser automation via MCP with device emulation; verified 5,510★ and documents 143 device profiles plus `playwright install` setup.

by MCP Hub·85 views

$ tokrepo install playwright-mcp-browser-automation-for-agents

Skill#09

Claude Code Agent: Playwright Tester

Testing mode for Playwright tests

by TokRepo精选·27 views

$ tokrepo install claude-code-agent-playwright-tester-cf9884c5

Skill#10

verify-app — E2E Test Subagent for Claude Code

Open-source Claude Code subagent that runs end-to-end tests on recent changes and triages failures. Inspired by Boris Cherny's verify-app setup.

by Skill Factory·210 views

$ tokrepo install verify-app-e2e-test-subagent-for-claude-code-203ea157

常见问题

同一个项目真的要同时装 Vitest 和 Jest 吗？

不用。挑一个。pack 列了两个是因为不同项目栈不一样 — Vite 系装 Vitest，老 CRA / Node CommonJS 留 Jest。新项目直接选 Vitest 跳过第 3 步。Test Engineer agent (#1) 扫一下 repo 30 秒告诉你装哪个。

Playwright (#7) / Playwright MCP (#8) / Playwright Tester agent (#9) 区别？

Playwright 是框架 — CI 里跑 .spec.ts 文件。Playwright MCP 是 server，让编码 agent 交互式开真浏览器（探索好用，CI 跑费用爆炸）。Playwright Tester agent 是专门给你写 .spec.ts 的。流水线：Tester agent 写 spec → CI 用 Playwright 跑 → MCP 留给「现在就要真浏览器看看」的临时活。

AI 能直接生成示例的话，为啥还要 Hypothesis？

AI 生成它想得到的示例。Hypothesis 从性质推导：带 unicode 边界的随机字符串、边界附近的整数、共享引用的列表 — 人（或 LLM）不会想到去试的 case。配合很好：让 Claude 提出性质陈述，Hypothesis 找反例，把测试 ship 上去。

只有一下午的话，先装哪三个？

Test Engineer agent (#1)、Playwright (#7)、verify-app (#10)。Agent 规划，Playwright 跑杠杆最高的测试（E2E 能抓单测永远抓不到的 bug），verify-app 解释失败。第二天按语言加 Vitest (#2) 或 pytest (#4)。中间几个（MSW、Hypothesis、Playwright MCP）是第二周回报的升级。

这个 pack 是不是只针对 Claude Code？

两个 subagent（#1 Test Engineer、#10 verify-app）和 #9 Playwright Tester 是 Claude Code 原生。其它（Vitest / Jest / pytest / Hypothesis / MSW / Playwright / Playwright MCP）跟语言和工具无关 — Cursor / Codex CLI / Cline / Roo / 纯 CLI 都能跑。不在 Claude Code 上的话，把三个 agent 换成你工具链里的等价物，剩下的安装顺序照搬。

更多主题包

12 个主题包 · 80+ 精选资产

回首页浏览全部精选合集

返回主题包总览