WorkflowsApr 1, 2026·1 min read

Dagster — Cloud-Native Data Pipeline Orchestrator

Dagster orchestrates data pipelines with declarative assets, lineage tracking, and observability. 15.2K+ stars. Python, asset-based, testable. Apache 2.0.

TO
TokRepo精选 · Community
Quick Use

Use it first, then decide how deep to go

This block should tell both the user and the agent what to copy, install, and apply first.

uv add dagster dagster-webserver dagster-dg-cli
dagster dev
# Open http://localhost:3000

Intro

Dagster is a cloud-native data pipeline orchestrator for the whole development lifecycle. With 15,200+ GitHub stars and Apache 2.0 license, it uses a declarative programming model where you define data assets as Python functions. Built-in lineage tracking, observability, multi-tenant infrastructure, CI/CD best practices, and modern data stack integration. Python 3.9-3.14.

Best for: Data engineers building production data pipelines with lineage and observability Works with: Claude Code, OpenAI Codex, Cursor, Gemini CLI, Windsurf


Key Features

  • Declarative asset definitions in Python
  • Built-in data lineage and observability
  • Multi-tenant production infrastructure
  • CI/CD and testing best practices
  • Modern data stack integrations
  • Web UI dashboard

FAQ

Q: What is Dagster? A: Cloud-native data pipeline orchestrator with 15.2K+ stars. Declarative assets, lineage, observability. Apache 2.0.

Q: How do I install Dagster? A: uv add dagster dagster-webserver. Then dagster dev.


🙏

Source & Thanks

dagster-io/dagster — 15,200+ GitHub stars

Related Assets