Dagster

What is Dagster

Dagster is an orchestrator that’s designed for developing and maintaining data assets, such as tables, data sets, machine learning models, and reports. You declare functions that you want to run and data assets that those functions produce or update. Dagster then helps you run your functions at the right time and keep your assets up-to-date.

Dagster

Advantages

Quick navigation

A structured event log allows you to quickly navigate to important entries, such as error messages.

Advanced metadata

Users can embed arbitrary structured metadata in the event log to track properties over time, including using advanced display options.

Lightweight Python execution APIs

Dagster pipelines can execute completely in-memory, with no required database or scheduler process.

Reproducible history

Any historical run or DAG structure can be replayed regardless of system changes.

Multiple schedules

Users can run the same pipeline or pipeline subsets on multiple schedules.

Dagster Alternatives

Airflow

Airflow

What is Airflow?

Airflow is an open-source tool for data pipeline management. It helps you to organize your pieces of code into DAGs, run them, and test and monitor the execution.

Learn more
Luigi

Luigi

What is Luigi?

Luigi is a Python package that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization, handling failures, command line integration, and much more.

Learn more

How do you rate the tool?