Cette page est affichée en anglais. Une traduction française est en cours.
KnowledgeMay 19, 2026·2 min de lecture

MCP Latency Probe — tools/list p95 Runbook

MCP tool calling latency runbook for agents. Measures tools/list p95, separates server latency from network delay, and defines pause rules.

Prêt pour agents

Cet actif peut être lu et installé directement par les agents

TokRepo expose une commande CLI universelle, un contrat d'installation, le metadata JSON, un plan selon l'adaptateur et le contenu raw pour aider les agents à juger l'adaptation, le risque et les prochaines actions.

Needs Confirmation · 64/100Policy : confirmer
Surface agent
Tout agent MCP/CLI
Type
Knowledge
Installation
Single
Confiance
Éditeur vérifié
Point d'entrée
README.md
Commande CLI universelle
npx tokrepo install 6bda6a2c-4eff-47f7-ae1c-fbd382e11100

What To Measure

Track four numbers every time:

Signal Why it matters
tools/list p95 from production Proves the hosted MCP route and local reverse proxy are healthy.
tools/list p95 from the automation host Shows what external agents may experience from that network.
HTTP status mix Separates latency from auth, routing, or deploy failures.
Tool count Detects a bad deploy that returns 200 with a truncated catalog.

A practical threshold is p95 < 800ms for the production-hosted probe. If the production probe is 20ms but an overseas laptop sees 2200ms, do not call the MCP server broken. Report the product as healthy and log the client-network penalty separately.

For search and triage, label the incident explicitly as MCP tool calling latency when the slow path affects tools/list, tools/call, or another JSON-RPC tool method. That wording keeps the runbook discoverable when an agent searches for "mcp tool calling latency" instead of "MCP endpoint p95".

Common Failure Modes

  • Cold Nuxt process: first hit after restart is slow, later hits are fast. Use at least 10 samples.
  • Reverse proxy buffering or TLS path: localhost is fast, public domain is slow from the same server.
  • Caller geography: server-side probe is fast, laptop probe is slow. This is a distribution-network issue, not route logic.
  • JSON-RPC body mismatch: a GET or malformed POST may exercise a different handler.
  • Tool catalog bloat: large descriptions can make tools/list slow even when routing is fine.

Decision Rule

  1. If production-side p95 is under the target and status is 200, keep the MCP service marked healthy.
  2. If production-side p95 is over target twice in a row, pause growth actions and inspect server logs, PM2 uptime, and route payload size.
  3. If only external p95 is high, report the geography/network caveat and continue product-quality work.
  4. If tool count changes unexpectedly, verify the deployed manifest before doing any promotion.
🙏

Source et remerciements

This is an original TokRepo runbook by William Wang. It uses standard JSON-RPC MCP semantics from the Model Context Protocol documentation and ordinary curl timing fields from the curl write-out documentation.

Fil de discussion

Connectez-vous pour rejoindre la discussion.
Aucun commentaire pour l'instant. Soyez le premier à partager votre avis.

Actifs similaires