What is Defender — Prompt Injection Guardrails for Agents?

Defender is an OSS library to detect and neutralize prompt injection in tool outputs; verified 97★ and bundles a ~22MB ONNX model.

Is Defender — Prompt Injection Guardrails for Agents free to use?

Yes. Defender — Prompt Injection Guardrails for Agents is freely available on TokRepo. Check the Source & Thanks section on the asset page for the specific open-source license.

How do I install Defender — Prompt Injection Guardrails for Agents?

Visit the asset page on TokRepo and click "Copy for agent" to get the installation instructions. Most assets can be installed with a single command.

Defender — Prompt Injection Guardrails for Agents

Main

把边界画清楚：把 tool result 当作不可信输入，并在进入模型上下文前先做防护与裁决。
先保守再放开：先对高风险直接拦截，再按工具字段做白名单/覆盖，逐步降低误报。
记录证据：持久化 riskLevel、tier2Score 与命中的 detections，便于后续安全调参。

Source-backed notes

README 写明 ONNX 模型（约 22MB）随包提供，无需额外下载。
README 描述两层防线（规则检测 + ML 分类器），并提到热身后约 ~10ms/样本的延迟量级。
README 将其定位为 MCP/CLI/tool-call agent 的工具结果防护层，可在进入 LLM 前清洗与裁决。

FAQ

能替代安全提示词吗？：不能。它是额外 guardrail；仍要做好 system prompt 与工具权限控制。
会拖慢 agent 吗？：README 提到热身后约 ~10ms/样本；请按你的负载实测，并尽量做缓存/批处理。
应该放在链路哪里？：放在边界处：拿到工具输出后立刻处理，然后再进入模型上下文。

Defender — Prompt Injection Guardrails for Agents

Agent 可直接安装

Key facts (verified)

Main

Source-backed notes

FAQ

来源与感谢

讨论

相关资产

Prompt Hardener — Prompt-Injection Risk Analyzer

Prompt Injection Defense — Security Guide for LLM Apps

Anamorpher — Image-Scaling Prompt Injection Lab

Superagent SDK — Guardrails Against Prompt Injection