TOKREPO · ARSENAL
New · this week

Lawyer's AI Contract Review Kit

Ten privacy-aware picks for in-house counsel, firm associates, and solo lawyers using AI to redline contracts, build a clause library, and triage NDAs/MSAs. Intake & archive → OCR → local LLM (so confidential drafts never leave your laptop) → clause RAG → privacy proxy for cloud calls → e-sign. Install in this order.

10 assets

What's in this pack

This is a stack for the lawyer who has reluctantly admitted that AI can shave hours off a contract review — and is then horrified by the privacy implications of pasting a draft MSA into a public chatbot. The whole pack is built around one principle: the more sensitive the document, the more local the processing. Cloud LLMs are still here, but only behind a privacy proxy and only for tasks where the content is non-confidential or already redacted.

Nothing in this kit is legal advice and nothing replaces a qualified attorney's judgment. These are infrastructure picks — the plumbing that lets you, the lawyer, work faster on the parts that are mechanical (extracting parties, finding the indemnity clause, flagging missing notice provisions) so you can spend your billable hours on the parts that are not (jurisdiction nuance, business risk, client strategy).

Install in this order (intake → review → clause library → redline → output)

  1. Claude Code Agent: Legal Advisor — a focused subagent profile for legal documentation and compliance tasks. Use it as your orchestrator: it knows when to ask for the document type, what fields to extract, and how to structure the review. It is not a substitute for an attorney; treat it as a junior paralegal that drafts the first pass.
  2. Papra — minimalist self-hosted document archive. Drop incoming contracts here first. Self-hosting means client documents live on infrastructure you control instead of a third-party SaaS. Tag by counterparty, matter, and status.
  3. Paperless-ngx — full document-management system with built-in OCR and automatic tagging. Use it when Papra is too thin — when you have hundreds of executed contracts and need search across the body text, not just the filename.
  4. Claude Official Skill: PDF — read, create, and edit PDFs from inside Claude Code. This is your everyday tool for opening a counterparty's draft, extracting the signature block, and writing comments back to the same file.
  5. Zerox — zero-shot PDF OCR designed for AI pipelines. Use it for the contracts that arrive as a fax-of-a-photocopy-of-a-fax. Output is structured text suitable for downstream LLM redlining without manual cleanup.
  6. Jan — offline AI desktop app with full local privacy. This is where confidential drafts actually live during review. No network call. No cloud log. The model runs on your machine; the document never leaves it.
  7. Ollama — runs the open-weight models that power Jan (and anything else local). Llama 3.x or Qwen at a sensible quantisation handles routine clause comparison and extraction well; you reserve frontier cloud models for the genuinely hard reasoning, behind the proxy below.
  8. Cherry Studio Knowledge Base — local RAG over 50+ file formats. This is your clause library: feed it your firm's preferred indemnity language, every MSA you've signed, and your house-style NDA. The agent retrieves precedent clauses by similarity instead of guessing.
  9. pasteguard — local privacy proxy for AI tools. When a task genuinely needs a frontier cloud model, pasteguard sits between you and the API and strips names, addresses, monetary amounts, and other configurable patterns before the request leaves the laptop. Re-inflates them in the response.
  10. Documenso — open-source DocuSign alternative. Once the redline is agreed, route the final document through an e-sign platform you self-host. Audit trail stays on your infrastructure; nothing about the matter ends up in a SaaS log you don't control.

How they fit together

  Counterparty draft ─► Papra / Paperless-ngx (intake + OCR + archive)
                          │
                          ▼
              Claude PDF Skill / Zerox
             (extract text + structure)
                          │
                          ▼
        ┌────── Legal Advisor agent ──────┐
        │   (orchestrates the review)      │
        │     ▲                            │
        │     │ retrieve precedent         │
        │     └─ Cherry Studio (clause RAG)│
        └────────────────┬─────────────────┘
                         ▼
            ┌─ local redline path ─┐    ┌─ cloud path (rare) ─┐
            │   Jan + Ollama       │    │   pasteguard ──► API │
            │ (confidential docs)  │    │ (redact → call → re- │
            │                      │    │   inflate response)  │
            └──────────┬───────────┘    └──────────┬──────────┘
                       └──────────┬─────────────────┘
                                  ▼
                            Documenso
                       (self-hosted e-sign)

Tradeoffs you'll hit

  • Cloud vs local for confidential docs. Cloud frontier models are better at long-context legal reasoning today; local models are dramatically better at not leaking your client's M&A draft. The pack is biased toward local because the consequence of a leak (privilege waiver, malpractice exposure) is asymmetric. Default local; reach for cloud only behind pasteguard and only for non-sensitive content.
  • AI redline vs manual. AI is reliably good at finding what should be there and isn't (missing notice provision, no jurisdiction clause), at extracting structured data, and at comparing two drafts. It is unreliably good at judging whether a deviation matters — that is still the lawyer's job. Use the agent for the first pass; never accept its output without reading it.
  • Self-hosted DMS vs Clio/iManage. Papra and Paperless-ngx win on data sovereignty and cost; they lose on integrations with the rest of the legal-tech ecosystem (time tracking, billing, conflicts). If you already run a managed DMS, keep it and bolt this stack on for the AI layer only.
  • One local model vs the frontier. A 7B–14B local model is competent on most clause-level tasks. It will miss subtleties a 200B+ frontier model catches. That is acceptable for triage; it is not acceptable as a final review. The pack assumes you, the human, are still the last step.

Common pitfalls

  • Privilege waiver risk from cloud chats. Pasting privileged material into a consumer chatbot can — depending on jurisdiction and terms of service — count as disclosure to a third party. pasteguard plus local-first models is how you de-risk this; do not skip those layers because they feel like extra friction.
  • Trusting the AI on jurisdiction-specific nuance. A frontier model will confidently apply New York reasoning to a Delaware question. Always check the governing-law clause first and verify any cited rule against an actual primary source.
  • Treating extraction as judgment. Just because the agent surfaced an unusual indemnity cap does not mean the cap is unreasonable for the deal. Extraction is mechanical; characterising risk is professional judgment. Separate those steps in your workflow.
  • No version control on the clause library. Cherry Studio holds your precedent corpus. Back it up. Version it. If a junior accidentally feeds a counterparty's confidential clause into your library and it surfaces in the next deal, you have a problem.
  • Skipping the OCR step. Running an LLM directly on a scanned PDF wastes tokens and produces garbage extractions. Zerox or Paperless-ngx OCR first; LLM second.
INSTALL · ONE COMMAND
$ tokrepo install pack/lawyer-ai-contract-kit
hand it to your agent — or paste it in your terminal
What's inside

10 assets in this pack

Skill#01
Claude Code Agent: Legal Advisor

Legal documentation and compliance specialist. Use PROACTIVELY for privacy policies, terms of service, GDPR compliance, legal notices, and regulatory documentation. Expert in...

by TokRepo精选·35 views
$ tokrepo install claude-code-agent-legal-advisor-8fd81473
Skill#02
Papra — Minimalistic Self-Hosted Document Archiving

A lightweight, self-hosted document archiving platform for organizing and preserving receipts, invoices, contracts, and other important files with tagging, full-text search, and a clean web interface.

by Script Depot·83 views
$ tokrepo install papra-minimalistic-self-hosted-document-archiving-3784121e
Skill#03
Paperless-ngx — Self-Hosted Document Management with OCR

Paperless-ngx is an open-source document management system that scans, OCRs, indexes, and archives all your physical and digital documents for full-text search.

by Script Depot·249 views
$ tokrepo install paperless-ngx-self-hosted-document-management-ocr-de0041a5
Skill#04
Claude Official Skill: PDF — Read, Create & Edit PDFs

Claude Code skill for PDF files. Read content, extract data, create new PDFs, merge documents, and convert formats. Activates automatically.

by Anthropic·192 views
$ tokrepo install claude-official-skill-pdf-read-create-edit-pdfs-8341096b
Skill#05
Zerox — Zero-Shot PDF OCR for AI Pipelines

Extract text from any PDF using vision models as OCR. Zerox converts PDF pages to images then uses GPT-4o or Claude to extract clean markdown without training.

by Script Depot·205 views
$ tokrepo install zerox-zero-shot-pdf-ocr-ai-pipelines-3ac555d9
Skill#06
Jan — Offline AI Desktop App with Full Privacy

Jan is an open-source ChatGPT alternative that runs LLMs locally with full privacy. 41.4K+ GitHub stars. Desktop app for Windows/macOS/Linux, OpenAI-compatible API, MCP support. Apache 2.0.

by AI Open Source·215 views
$ tokrepo install jan-offline-ai-desktop-app-full-privacy-7b703194
Skill#07
Ollama — Run LLMs Locally

Run large language models locally on your machine. Supports Llama 3, Mistral, Gemma, Phi, and dozens more. One-command install, OpenAI-compatible API.

by Script Depot·197 views
$ tokrepo install ollama-run-llms-locally-0eefb7ad
Skill#08
Cherry Studio Knowledge Base — Local RAG with 50+ Formats

Cherry Studio Knowledge Base ingests PDFs, Office docs, Markdown into a local vector index. Query offline, BYOK any LLM. Data stays on your machine.

by Cherry Studio·129 views
$ tokrepo install cherry-studio-knowledge-base-local-rag-with-50-formats
Skill#09
pasteguard — Local Privacy Proxy for AI Tools

PasteGuard is an open-source local proxy that masks API keys, emails, and PII before requests reach OpenAI/Anthropic. Start via Docker.

by Script Depot·65 views
$ tokrepo install pasteguard-local-privacy-proxy-for-ai-tools
Skill#10
Documenso — Open Source Document Signing Platform

Documenso is an open-source DocuSign alternative for self-hosted document signing with PDF e-signatures, audit trails, and Next.js stack.

by AI Open Source·221 views
$ tokrepo install documenso-open-source-document-signing-platform-f8e25455
FAQ

Frequently asked questions

Is it ethical (or safe) to use a cloud LLM on a confidential contract at all?

It depends on your jurisdiction, the provider's terms (zero-retention enterprise plans are very different from consumer free tiers), and the type of confidentiality obligation. The defensible default is: assume any consumer chatbot may train on or retain your input, treat that as a third-party disclosure, and only use cloud models on documents you've redacted via something like pasteguard, or on enterprise plans with contractual zero-retention. This pack biases local-first precisely because that posture removes the question.

Can I rely on the agent's redline as a final review?

No. None of these tools — not Claude, not Llama, not any current frontier model — should be the last set of eyes on a contract you're advising on. They are excellent at extraction, comparison, and finding the obvious gaps; they are unreliable at judging whether a specific deviation creates business or legal risk. Treat the AI output as a junior associate's first pass: useful, often correct, never trusted without your read.

What's the smallest version of this kit I can run on my laptop tonight?

Three picks: Ollama (one install command, pull a Llama 3 or Qwen variant), Jan (desktop UI that talks to Ollama), and the Claude Official Skill: PDF if you already use Claude Code. That gets you fully-local clause comparison and redline on documents you drag into the desktop app — no archive, no clause RAG, no e-sign yet, but enough to feel whether local LLM review is fit for your practice before you build out the rest.

Why two document tools (Papra and Paperless-ngx)?

Different scales. Papra is a minimalist archive — fast to stand up, friendly UI, good for a solo or a small in-house team that just needs a tagged folder of incoming contracts. Paperless-ngx is a full DMS with OCR, auto-tagging, and search across body text — overkill for ten contracts a month, indispensable when you have thousands. Most lawyers start with Papra and graduate to Paperless-ngx; some run both (Papra for active matters, Paperless-ngx as the long-term archive).

How does the clause library in Cherry Studio compare to a precedent management product like Brightflag or Heretto?

Cherry Studio is a local RAG tool, not a legal-tech product. It can ingest your firm's preferred-clause Word docs, prior executed agreements, and house-style playbooks, then retrieve relevant precedent during a review. It will not handle conflicts checking, billing rules, or the workflow features that purpose-built precedent products offer. The fit is: solo practitioners and small in-house teams who want a private, local precedent search without the enterprise legal-tech contract.

MORE FROM THE ARSENAL

12 packs · 80+ hand-picked assets

Browse every curated bundle on the home page

Back to all packs