Knowledge2026年4月2日·1 分钟阅读
DocETL — LLM-Powered Document Processing Pipelines
Declarative YAML pipelines for LLM document analysis with map, reduce, and resolve operators. By UC Berkeley. 3.7K+ stars.
TO
TokRepo精选 · Community
快速使用
先拿来用,再决定要不要深挖
这里应该同时让用户和 Agent 知道第一步该复制什么、安装什么、落到哪里。
```bash
pip install docetl
```
创建管线 YAML 文件,定义 map(逐文档处理)、reduce(聚合分组)等操作:
```yaml
operations:
- name: summarize
type: map
prompt: "用3句话总结这篇论文:{{ input.content }}"
output:
schema:
summary: string
pipeline:
steps:
- name: summarize_papers
input: papers
operations: [summarize]
```
运行:
```bash
docetl run pipeline.yaml
```
---
🙏
来源与感谢
> Created by [UC Berkeley EPIC Lab](https://github.com/ucbepic). Licensed under MIT.
>
> [docetl](https://github.com/ucbepic/docetl) — ⭐ 3,700+
讨论
登录后参与讨论。
还没有评论,来写第一条吧。
相关资产
LaVague — Natural Language Web Automation
Give a text objective, LaVague drives the browser to accomplish it. Large Action Model framework for web agents. 6.3K+ stars.
TokRepo精选
Trae Agent — AI Coding Agent by ByteDance
Open-source autonomous coding agent for software engineering tasks. Multi-provider LLM support. By ByteDance. 11K+ stars.
TokRepo精选
bolt.diy — AI Full-Stack App Builder, Any LLM
Community fork of Bolt.new. Prompt, edit, and deploy full-stack web apps with any LLM provider. 19K+ GitHub stars.
TokRepo精选