What is Superagent SDK — Guardrails Against Prompt Injection?

Superagent SDK helps protect AI apps: detect prompt injections, redact PII/secrets, scan repos for poisoning, and integrate via TS/Python, CLI, or MCP.

Is Superagent SDK — Guardrails Against Prompt Injection free to use?

Yes. Superagent SDK — Guardrails Against Prompt Injection is freely available on TokRepo. Check the Source & Thanks section on the asset page for the specific open-source license.

How do I install Superagent SDK — Guardrails Against Prompt Injection?

Visit the asset page on TokRepo and click "Copy for agent" to get the installation instructions. Most assets can be installed with a single command.

Superagent SDK is an open-source safety toolkit for agent apps: guard against prompt injections, redact PII/secrets, and scan repos for threats. It offers TS/Python SDKs and a CLI, and is GitHub-verified at 6,594★.

Best for: shipping agent apps that need runtime prompt-injection detection and secret/PII redaction

Works with: TypeScript/Python SDKs; optional CLI; integrates with Claude Code/Claude Desktop via MCP server (per repo docs)

Setup time: 10–25 minutes

Key facts (verified)

README lists features: Guard (prompt injection), Redact (PII/secrets), Scan (repo threats).
README shows install npm install safety-agent and environment var SUPERAGENT_API_KEY.
README mentions open-weight guard models with 50–100ms latency (as stated in README).
GitHub: 6,594 stars · 962 forks; pushed 2026-04-11 (GitHub API verified).

Main

Add safety guardrails like you add observability:

Run Guard on every user message and on every tool-call instruction.
Redact before logging or storing transcripts.
Use Scan as part of CI for repos that agents can read, to catch repo poisoning patterns.

Start with "report-only" mode (log findings), then switch to blocking once you see false-positive rates in your domain.

README excerpt (verbatim)

Superagent SDK

Make your AI apps safe.

Website · Docs · Discord · HuggingFace

An open-source SDK for AI agent safety. Block prompt injections, redact PII and secrets, scan repositories for threats, and run red team scenarios against your agent.

Features

Guard

Detect and block prompt injections, malicious instructions, and unsafe tool calls at runtime.

TypeScript:

import { createClient } from "safety-agent";

const client = createClient();

const result = await client.guard({
  input: userMessage
});

if (result.classification === "block") {
  console.log("Blocked:", result.violation_types);
}

Python:

from safety_agent import create_client

client = create_client()

result = await client.guard(input=user_message)

if result.classification == "block":
    print("Blocked:", result.violation_types)

Redact

Remove PII, PHI, and secrets from text automatically.

TypeScript:

const result = await client.redact({
  input: "My email is john@example.com and SSN is 123-45-6789",
  model: "openai/gpt-4o-mini"
});

console.log(result.redacted);
// "My email is <EMAIL_REDACTED> and SSN is <SSN_REDACTED>"

Python:

result = await client.redact(
    input="My email is john@example.com and SSN is 123-45-6789",
    model="openai/gpt-4o-mini"
)

print(result.redacted)
# "My email is <EMAIL_REDACTED> and SSN is <SSN_REDACTED>"

Scan

Analyze repositories for AI agent-targeted attacks such as repo poisoning and malicious instructions.

TypeScript:

const result = await client.scan({
  repo: "https://github.com/user/repo"
});

console.log(result.result);  // Security report
console.log(`Cost: $${result.usage.cost.toFixed(4)}`);

Python:

result = await client.scan(repo="https://github.com/user/repo")

print(result.result)  # Security report
print(f"Cost: ${result.usage.cost:.4f}")

FAQ

Q: Is Superagent open source? A: Yes—repo is MIT-licensed per GitHub license metadata.

Q: How do I install it? A: README shows npm install safety-agent (and uv add safety-agent for Python).

Q: What does it do? A: Guard against prompt injection, redact secrets/PII, and scan repos for threats (per README).

Superagent SDK — Guardrails Against Prompt Injection

Cet actif peut être lu et installé directement par les agents

Key facts (verified)

Main

README excerpt (verbatim)

Superagent SDK

Features

Guard

Redact

Scan

FAQ

Source et remerciements

Fil de discussion

Actifs similaires

Defender — Prompt Injection Guardrails for Agents

Spikee — Prompt Injection Eval Kit (CLI)

SafeLLMPlayground — Prompt Injection Text-Adventure

Anamorpher — Image-Scaling Prompt Injection Lab