Is Presidio — Detect and Anonymize PII free to use?

Yes. Presidio — Detect and Anonymize PII is freely available on TokRepo. Check the Source & Thanks section on the asset page for the specific open-source license.

How do I install Presidio — Detect and Anonymize PII?

Visit the asset page on TokRepo and click "Copy for agent" to get the installation instructions. Most assets can be installed with a single command.

SkillsMay 11, 2026·2 min read

Presidio — Detect and Anonymize PII

Detect and anonymize PII in text with Microsoft Presidio, then feed sanitized inputs to LLMs to reduce leakage risk. Works via pip or Docker deployments.

Script Depot · Community

Agent ready

Ready-to-run agent install

This asset can be installed after the agent chooses its runtime, checks the plan, and runs the matching command.

Native · 98/100Policy: allow

Agent surface

Any MCP/CLI agent

Kind

Skill

Install

Single

Trust

Trust: Established

Entrypoint

Asset

Direct install command

npx -y tokrepo@latest install d4d3e9a3-9494-4b05-bf05-74368b2ff338 --target codex

Run after dry-run confirms the install plan.

Intro

Detect and anonymize PII in text with Microsoft Presidio, then feed sanitized inputs to LLMs to reduce leakage risk. Works via pip or Docker deployments.

Best for: LLM apps handling customer data that need PII de-identification before prompts, logs, or embeddings
Works with: Python, text pipelines, pre-processing for prompts/logging/indexing; optional Docker services
Setup time: 18 minutes

Quantitative Notes

Setup time ~18 minutes (pip install + download one NLP model if needed)
GitHub stars + forks (verified): see Source & Thanks
Common pattern: sanitize inputs + sanitize outputs + sanitize logs (3 enforcement points)

Practical Notes

For production, treat PII sanitization as a policy: define what counts as PII for your domain, add allowlists for non-sensitive identifiers, and write regression tests with real-ish examples. Use Presidio as a pre-processor before prompts and embeddings, and consider sanitizing outputs as well when users paste secrets.

Safety note: PII detection is probabilistic—combine rules, tests, and human review for high-stakes data flows.

FAQ

Q: Why use it with LLMs? A: It reduces the chance of leaking personal data to model providers, logs, or downstream tools.

Q: Is it only for text? A: This repo focuses on PII anonymization tooling; follow the docs for supported modalities and deployments.

Q: Where should I integrate it? A: Integrate in your request middleware and also sanitize transcripts before storage or embeddings.

🙏

Source & Thanks

GitHub: https://github.com/microsoft/presidio Owner avatar: https://avatars.githubusercontent.com/u/6154722?v=4 License (SPDX): MIT GitHub stars (verified via api.github.com/repos/microsoft/presidio): 8,019 GitHub forks (verified via api.github.com/repos/microsoft/presidio): 1,041

Discussion

No comments yet. Be the first to share your thoughts.

Related Assets

draw.io — Free Open-Source Diagramming Tool for Any Platform

draw.io is a free, browser-based diagramming application that supports flowcharts, UML, network diagrams, and more. Works offline as a desktop app on Windows, macOS, and Linux with no account required.

Skills

Script Depot

WinUI 3 — Modern Native UI Framework for Windows Apps

Build polished Windows desktop applications using Microsoft's latest native UI framework with Fluent Design, XAML, and the Windows App SDK.

Skills

Script Depot

Babylon.js — Powerful 3D Game and Rendering Engine

Babylon.js is a powerful, beautiful, simple, open 3D game and rendering engine for the web. WebGL + WebGPU, Playground IDE, Node Material Editor, GUI system, physics, and VR/AR support. Microsoft-backed with enterprise polish.

Skills

Script Depot

Gatsby — React-Based Framework for Performant Static Sites

Gatsby is a React-based open-source framework for building fast, secure websites and apps. It combines static site generation with dynamic capabilities, pulling data from any source via GraphQL.

Skills

Script Depot

◈Home 🔍Search 👤Me