Agent 漏斗

实时 Agent 正在用 TokRepo

最近 14 天 agent 漏斗匿名聚合计数。所有事件由 agent funnel API 记录;不记任务文本、文件内容、tokens 或私有资产数据。

设置 TOKREPO_TELEMETRY=0 可关闭埋点。数据源:/api/v1/tokenboard/agent/funnel。

43.4%
中位数 token 节省
1,096
Agent 事件总数
19
Discover 调用
198
Install Apply
39
Handoff
任务经济

高价值 Agent 任务的运行模型

这里是机器契约背后的产品面:TokRepo 借鉴了哪些任务经济机制、哪些生命周期 gate 已经接入、哪些 KPI 已经有公开 eval 证据。

100%
已量化
high_value_task_completion_rate

Percentage of realistic task prompts that complete with a verified user-outcome oracle.

/evals/agent-task-receipts.json
20
已量化
duplicate_rebuilds_avoided

Count of reference tasks where TokRepo discovery avoids rebuilding an equivalent local asset from scratch.

/evals/agent-baseline.json
43.4%
已量化
median_tokens_saved_pct

Median token reduction from using discovered reusable assets on reference tasks.

/evals/agent-baseline.json
追踪中
已签约
reuse_to_creation_ratio

Ratio of resolved/reused capabilities to newly harvested private drafts over a release window.

/agent-task-economy.json
100%
已量化
safe_install_gate_coverage_pct

Percentage of installable assets covered by verify, install-plan, policy, evidence bundle, and rollback checks.

/evals/agent-baseline.json
100%
已量化
handoff_quality_pass_rate

Percentage of harvested or handoff candidates whose quality_gate passes without unsafe files or unresolved sensitive findings.

/evals/agent-task-receipts.json
100%
已量化
long_running_task_state_coverage

Percentage of recurring or delayed agent tasks with owner, schedule, latest evidence, next action, and rollback or handoff state.

/evals/agent-task-ledger.json

Mini harness 证据

每条 case 都记录 Task、Environment、Tools、Trace 和 Grader 输出,TokRepo 不只根据最终回答判断任务经济闭环是否通过。

3/3
通过 case
100%
通过率
100%
模块覆盖
5
harness 模块
capability_gap_plugin_support

Decide whether the repository README proves plugin-system support.

TaskEnvironmentToolsTraceGrader
2 trace 步grader 通过
reuse_before_rebuild_browser_harness

Before rebuilding a browser automation harness, resolve reusable TokRepo capability and inspect a safe install plan.

TaskEnvironmentToolsTraceGrader
3 trace 步grader 通过
post_task_private_harvest

After improving a reusable task harness script, package the change as a private draft and produce a handoff plan without publishing.

TaskEnvironmentToolsTraceGrader
3 trace 步grader 通过

生命周期 gate

  1. 01
    detect_high_value_task

    Classify whether the user task is a repeatable high-value task, a one-off local fix, or a post-task reusable artifact opportunity.

    task_value_hypothesisreuse_potentialrisk_profilesuccess_oracle
  2. 02
    resolve_capability

    Call tokrepo_resolve_capability or tokrepo agent-check before rebuilding local tools.

    selected_candidate_or_controlled_empty_statenext_mcp_callsfallback_cli_commandslifecycle_contract
  3. 03
    plan_safe_run

    Verify trust and request an install plan before writes, execution, credentials, global config, or recurring work.

    policy_decisionpermission_envelopeevidence_bundlerollback_plan
  4. 04
    execute_or_stage

    Use dry-run, stage-only, confirmation-required, or direct execution according to policy and user intent.

    changed_files_or_no_writestate_deltapost_verify_commands
  5. 05
    evaluate_task_outcome

    Evaluate the user's real task outcome with a task-specific oracle instead of claiming success from a local command only.

    task_outcome_verdictevidenceopen_risks
  6. 06
    record_memory_and_state

    Record install state, task evidence, audit snapshots, and project memory so future agents inherit the context.

    TokRepo.lock_or_state_referenceaudit_referenceproject_memory_reference
  7. 07
    harvest_reusable_work

    Run tokrepo_harvest or tokrepo harvest --changed --json when the task creates reusable instructions, scripts, configs, or evals.

    private_package_draftsquality_gatesensitive_findingsdedupe_discovery
  8. 08
    measure_task_economics

    Update aggregate evidence for task completion, duplicated rebuilds avoided, token/time saved, safe install coverage, and handoff quality.

    task_completion_rateduplicate_rebuilds_avoidedmedian_tokens_saved_pctsafe_install_gate_coverage_pcthandoff_quality_pass_rate

可借鉴机制

  • mini_agent_harness

    TokRepo now publishes a deterministic mini task harness eval where each case records the task, fixture environment, tool calls with arguments/results, answer, and grader checks.

    https://tokrepo.com/evals/agent-task-harness.json
  • high_value_task_economics

    Every agent-facing surface should answer which high-value task it helps complete, what duplicate rebuild it avoids, and how the task result is verified.

    https://tokrepo.com/evals/agent-baseline.json
  • production_harness

    TokRepo's harness is the MCP/CLI/install-plan/evidence bundle lifecycle: resolve, discover, verify, plan, dry-run, install or stage, audit, rollback, and handoff.

    https://tokrepo.com/.well-known/tokrepo.json
  • skill_supply_chain

    Treat skills, prompts, scripts, MCP configs, workflows, project rules, and evals as typed supply with private-by-default harvest and human-confirmed publishing.

    https://tokrepo.com/agent-ecosystem.json#supply_handoff
  • long_running_clock

    Represent recurring and delayed work as task runs with schedule, status, state, evidence, next action, and rollback or handoff plan rather than as disconnected one-shot commands.

    https://tokrepo.com/evals/agent-task-ledger.json
  • outcome_oracle_receipts

    TokRepo now publishes deterministic task receipts so long-running work can be audited for freshness and outcome, not only remembered as a scheduled state row.

    https://tokrepo.com/evals/agent-task-receipts.json
  • memory_and_persona_context

    Project memory from tokrepo init-agent, .tokrepo/agent.json, installed-state files, and audit records must carry the user's operating rules into future agent sessions.

    https://tokrepo.com/schemas/agent-memory.schema.json
  • multi_agent_reliability

    Acceptance gates must include cross-agent project memory, local and hosted MCP parity, npx latest packages, install safety, handoff quality, and production live checks.

    https://tokrepo.com/evals/multi-agent-compatibility.json
  • task_to_supply_flywheel

    After a task is verified, agents should run tokrepo_harvest or tokrepo harvest --changed --json, inspect quality_gate, keep drafts private, and only push explicit reviewed files after human confirmation.

    https://tokrepo.com/evals/handoff-quality.json

长任务状态

Treat recurring jobs, delayed follow-ups, and automations as first-class high-value task runs with state and evidence.

当前接入面
TokRepo.lock.tokrepo/state.json.tokrepo/agent.jsontokrepo installed --project --jsonhttps://tokrepo.com/evals/agent-task-ledger.jsonhttps://tokrepo.com/evals/agent-task-receipts.json
计划接入面
schedule/heartbeat connector metadataIM or notification connector metadata
任务运行 ledger

长任务、周期任务和延迟任务必须跨会话保留 owner、schedule、evidence、next action 以及 rollback 或 handoff 状态。

agent-task-ledger.json
3/3
有状态运行
100%
状态覆盖
100%
证据覆盖
0
过期或阻塞
  1. taskrun_agent_discovery_smoke_daily
    healthyowner: tokrepo-release-agentdailypass_with_warning
    schedule
    2026-05-28T00:20:00Z
    latest_evidence
    https://tokrepo.com/.well-known/tokrepo.json
    next_action
    Rerun after the external MCP Registry catches up to live manifest 2.16.1.
    rollback_or_handoff
    handoff · machine-readable endpoint regression or external registry lag persists past the next release window
  2. taskrun_private_harvest_review_weekly
    readyowner: tokrepo-harvest-agentweeklypass
    schedule
    2026-06-01T03:00:00Z
    latest_evidence
    https://tokrepo.com/evals/agent-task-harness.json
    next_action
    Run tokrepo_harvest on changed reusable scripts, keep drafts private, and require quality_gate.status=pass before handoff.
    rollback_or_handoff
    rollback · quality gate fails or sensitive_findings is nonzero
  3. taskrun_delayed_chrome_mcp_recheck
    needs_recheckowner: tokrepo-browser-ops-agentdelayedpass_proxy_connected
    schedule
    2026-05-27T03:30:00Z
    latest_evidence
    local:chrome-mcp-proxy-status
    next_action
    If MCP stdio transport is closed in the current agent session, reconnect the MCP server or start a fresh Codex session before browser-dependent validation.
    rollback_or_handoff
    handoff · mcp__chrome__ list_pages reports Transport closed after proxy health is green
任务运行收据

每个长期任务运行都需要最近一次收据,包含新鲜度、验证命令、证据摘要、结果 oracle、判定和用户效果。

agent-task-receipts.json
3/3
新鲜收据
100%
新鲜度
100%
oracle 覆盖率
100%
通过结果
  1. receipt_20260527_agent_discovery_smoke
    freshness_status: freshpass_with_warning
    last_run_at
    2026-05-27T04:36:00Z
    next_due_at
    2026-05-28T04:36:00Z
    outcome_oracle
    machine_readable_surface_contract · All TokRepo agent discovery surfaces return HTTP 200, include the task-economy eval set, and hosted MCP JSON-RPC tools answer without schema regressions.
    evidence_digest
    sha256:443da96fc8c6287ea881348554b3211c8b44fa36dce2cb3dee08d789815a84ef
    user_effect
    Planning agents can discover the live TokRepo MCP, CLI, manifest, harness, ledger, and task-economy surfaces without GUI browsing; the only warning is external registry version lag outside the production site.
  2. receipt_20260527_private_harvest_review
    freshness_status: freshpass
    last_run_at
    2026-05-27T04:38:00Z
    next_due_at
    2026-06-03T04:38:00Z
    outcome_oracle
    quality_gate_receipt · The private harvest case includes tokrepo_harvest and tokrepo_handoff_plan traces, keeps draft supply private by default, and passes the Grader checks.
    evidence_digest
    sha256:d27a4f80a242278fdd1115bc985fe762895fd742500d6b8b9ebbb92daeb1f389
    user_effect
    Reusable work created during agent sessions can be reviewed as private package drafts with quality gates before any public push.
  3. receipt_20260527_chrome_mcp_recheck
    freshness_status: freshpass_with_handoff
    last_run_at
    2026-05-27T04:44:00Z
    next_due_at
    2026-05-27T05:14:00Z
    outcome_oracle
    browser_validation_fallback · Proxy status remains chromeConnected=true and the production browser validation matrix passes with System Chrome when the current MCP stdio transport is closed.
    evidence_digest
    sha256:e6d852212930cd1bf3d70c6dcc35400c70c50bba8513e14d71607510081f0cb1
    user_effect
    Browser-dependent TokRepo production validation remains executable even when the current Codex Chrome MCP stdio transport needs a session-level reconnect.

标记为已量化的数字来自公开 eval 证据。已签约 KPI 会作为 live 义务持续展示,直到运行时 ledger 开始发布这些测量值。

Discover → Verify → Install → Handoff → Harvest → Publish

Agent 漏斗

Plan→implementation 边界是 TokRepo 的发力点。每一步把 CLI 与 MCP 两条路合并成一个可读信号,不论用户跑哪个 agent 都能看见。

  • 01初始化项目 agent memory(init-agent)init_agent
    31
  • 02Plan 时调 discover / agent-checkmcp_discover
    78
    252% 相对上一步
  • 03信任、权限、policy 验证verify_asset
    107
    137% 相对上一步
  • 04生成 install planinstall_plan
    38
    36% 相对上一步
  • 05执行 install(dry-run + 确认后)install_apply
    198
    521% 相对上一步
  • 06任务后 supply handoffhandoff_plan
    39
    20% 相对上一步
  • 07Harvest — 打包私有草稿harvest_plan
    52
    133% 相对上一步
  • 08Harvest 草稿发布入库harvest_publish
    0
    0% 相对上一步
  • 09发布可复用资产push
    9

计数不是 per-agent 唯一事件,一次 agent 任务通常触发多个事件。这是 population 视图,不是 session 视图。

每日量

每日事件量

全部漏斗阶段的事件按 UTC 日聚合。新增 agent 接入面通常会拉一波。

2026-05-25 — 1012026-05-26 — 1592026-05-27 — 552026-05-28 — 842026-05-29 — 6082026-05-30 — 22026-05-31 — 92026-06-01 — 32026-06-02 — 62026-06-03 — 92026-06-04 — 242026-06-05 — 62026-06-06 — 142026-06-07 — 16

2026-05-25 → 2026-06-07

事件分布

Agent 漏斗记录的全部事件

每行是一个 funnel event 类型。Agent 漏斗刻意保持窄口径——只记 gate 决策或为决策提供 evidence 的事件。

事件次数
install_apply198
install_dry_run179
verify_asset107
rollback_plan105
capability_resolve101
mcp_search90
agent_check78
harvest_plan52
install_plan38
agent_handoff35
audit_asset34
init_agent31
mcp_discover19
find_for_task13
push9
handoff_plan4
mcp_detail3
给你的 agent 接上

想让你的 agent 事件出现在这页?

放进任何使用 Claude Code、Codex、Cursor、Gemini CLI、Copilot、Cline、Windsurf、Roo、OpenHands 或 Aider 的项目即可。init-agent 命令会写入机器可读的 .tokrepo/agent.json 以及 11 个指令接入面。

npx tokrepo init-agent --target all