Workflows2026年4月1日·1 分钟阅读

Dagster — Cloud-Native Data Pipeline Orchestrator

Dagster orchestrates data pipelines with declarative assets, lineage tracking, and observability. 15.2K+ stars. Python, asset-based, testable. Apache 2.0.

TO
TokRepo精选 · Community
快速使用

先拿来用,再决定要不要深挖

这里应该同时让用户和 Agent 知道第一步该复制什么、安装什么、落到哪里。

uv add dagster dagster-webserver dagster-dg-cli
dagster dev
# Open http://localhost:3000

介绍

Dagster is a cloud-native data pipeline orchestrator for the whole development lifecycle. With 15,200+ GitHub stars and Apache 2.0 license, it uses a declarative programming model where you define data assets as Python functions. Built-in lineage tracking, observability, multi-tenant infrastructure, CI/CD best practices, and modern data stack integration. Python 3.9-3.14.

Best for: Data engineers building production data pipelines with lineage and observability Works with: Claude Code, OpenAI Codex, Cursor, Gemini CLI, Windsurf


Key Features

  • Declarative asset definitions in Python
  • Built-in data lineage and observability
  • Multi-tenant production infrastructure
  • CI/CD and testing best practices
  • Modern data stack integrations
  • Web UI dashboard

FAQ

Q: What is Dagster? A: Cloud-native data pipeline orchestrator with 15.2K+ stars. Declarative assets, lineage, observability. Apache 2.0.

Q: How do I install Dagster? A: uv add dagster dagster-webserver. Then dagster dev.


🙏

来源与感谢

dagster-io/dagster — 15,200+ GitHub stars

相关资产