MCP ConfigsMay 13, 2026·2 min read

haiku.rag — Agentic RAG CLI + MCP Server

haiku.rag is an agentic RAG toolkit with CLI, Python API, and MCP server; verified 524★ and supports `add-src`, `ask --cite`, and `serve --mcp`.

Agent ready

This asset can be read and installed directly by agents

TokRepo exposes a universal CLI command, install contract, metadata JSON, adapter-aware plan, and raw content links so agents can judge fit, risk, and next actions.

Needs Confirmation · 62/100Policy: confirm
Agent surface
Any MCP/CLI agent
Kind
Mcp
Install
Pip
Trust
Trust: Established
Entrypoint
haiku-rag serve --mcp --stdio
Universal CLI install command
npx tokrepo install bf886e93-454a-5713-8b61-1456eb2fefee
Intro

haiku.rag is an agentic RAG toolkit with CLI, Python API, and MCP server; verified 524★ and supports add-src, ask --cite, and serve --mcp.

Best for: Teams building citation-heavy RAG with local-first LanceDB storage and agent workflows

Works with: Python 3.12+ plus an embedding provider (Ollama/OpenAI/etc.) as required by README

Setup time: 6-15 minutes

Key facts (verified)

  • GitHub: 524 stars · 35 forks · pushed 2026-05-13.
  • License: MIT · owner avatar + repo URL verified via GitHub API.
  • README-backed entrypoint: haiku-rag serve --mcp --stdio.

Main

  • Start with one PDF and verify citations (--cite) before scaling to directory monitoring or research agents.

  • Use the MCP server mode when you want assistants like Claude Desktop to manage documents/search/QA as tools rather than pasted context.

  • Keep provider swaps explicit: embeddings and QA models are pluggable; document which provider you used for each dataset to make runs reproducible.

Source-backed notes

  • README states it is built on LanceDB, Pydantic AI, and Docling, and includes both CLI and Python API entrypoints.
  • README documents MCP server usage: haiku-rag serve --mcp --stdio and a sample mcpServers JSON config.
  • README lists multiple features including hybrid search, citations with page numbers/section headings, and local-first embedded LanceDB storage.

FAQ

  • Do I need an embedding provider?: Yes — README says you must configure one (Ollama/OpenAI/etc.) before indexing/searching.
  • Can I use it from an MCP client?: Yes — run serve --mcp --stdio and add it to your client config.
  • Is there a slim install?: Yes — README mentions haiku.rag-slim plus extras; use it when you want fewer deps.
🙏

Source & Thanks

Source: https://github.com/ggozad/haiku.rag > License: MIT > GitHub stars: 524 · forks: 35

Discussion

Sign in to join the discussion.
No comments yet. Be the first to share your thoughts.

Related Assets