Question 1

What is MinerU — Extract LLM-Ready Data from Any Document?

Accepted Answer

Convert PDFs, scans, and complex documents into clean Markdown or JSON for RAG and LLM pipelines. 57K+ GitHub stars.

Question 2

Is MinerU — Extract LLM-Ready Data from Any Document free to use?

Accepted Answer

Yes. MinerU — Extract LLM-Ready Data from Any Document is freely available on TokRepo. Check the Source & Thanks section on the asset page for the specific open-source license.

Question 3

How do I install MinerU — Extract LLM-Ready Data from Any Document?

Accepted Answer

Visit the asset page on TokRepo and click "Copy for agent" to get the installation instructions. Most assets can be installed with a single command.

MinerU — Extract LLM-Ready Data from Any Document

先拿来用，再决定要不要深挖

简介

核心功能

布局感知解析

表格提取

公式识别

OCR 集成

批量处理

来源与感谢

讨论

相关资产

Pydantic — Data Validation for AI Agent Pipelines

Open WebUI — Self-Hosted ChatGPT Alternative

Docusaurus — Build AI Tool Documentation Sites