Scripts2026年3月31日·1 分钟阅读

RAGFlow — Deep Document Understanding RAG Engine

Open-source RAG engine with deep document understanding. Parses complex PDFs, tables, images. Agent-powered Q&A with citations. Multi-model. 77K+ stars.

TO
TokRepo精选 · Community
快速使用

先拿来用,再决定要不要深挖

这里应该同时让用户和 Agent 知道第一步该复制什么、安装什么、落到哪里。

git clone https://github.com/infiniflow/ragflow.git
cd ragflow
docker compose -f docker/docker-compose.yml up -d

Open http://localhost and create an account. Upload documents and start asking questions.


介绍

RAGFlow is a leading open-source RAG engine that combines deep document understanding with agent capabilities. Unlike basic RAG that chunks text blindly, RAGFlow uses vision models to understand document layouts — tables, images, charts, multi-column text, and complex formatting. Provides accurate, cited answers with full source traceability. 77,000+ GitHub stars, Apache 2.0.

Best for: Enterprise document Q&A, knowledge bases with complex PDFs, legal/financial document analysis Works with: OpenAI, Anthropic, Ollama, Azure, any OpenAI-compatible API


Key Features

Deep Document Parsing

Vision-based layout analysis that understands:

  • Complex tables (merged cells, nested headers)
  • Embedded images and charts
  • Multi-column layouts
  • Headers, footers, page numbers
  • Mathematical formulas

Accurate Citations

Every answer includes exact source references — page number, paragraph, and highlighted text.

Agent Workflows

Built-in agent for multi-step reasoning, web search augmentation, and tool use.

Template-Based Chunking

Document-type-aware chunking strategies — different templates for papers, resumes, contracts, manuals.

Multi-Model Support

OpenAI, Anthropic, Google, Ollama, Azure, DeepSeek, Zhipu, and custom endpoints.

Knowledge Base Management

Web UI for uploading, organizing, and managing document collections across teams.


FAQ

Q: What is RAGFlow? A: An open-source RAG engine with deep document understanding. Parses complex PDFs with tables, images, and charts for accurate Q&A with citations. 77K+ GitHub stars.

Q: How is RAGFlow different from basic RAG? A: Basic RAG splits text into chunks by character count. RAGFlow uses vision models to understand document structure — tables remain as tables, images are captioned, and layouts are preserved.


🙏

来源与感谢

Created by InfiniFlow. Licensed under Apache 2.0. infiniflow/ragflow — 77,000+ GitHub stars

相关资产