Dagster is an orchestrator that’s designed for developing and maintaining data assets, such as tables, data sets, machine learning models, and reports. You declare functions that you want to run and data assets that those functions produce or update. Dagster then helps you run your functions at the right time and keep your assets up-to-date.
Quick navigation
A structured event log allows you to quickly navigate to important entries, such as error messages.
Advanced metadata
Users can embed arbitrary structured metadata in the event log to track properties over time, including using advanced display options.
Lightweight Python execution APIs
Dagster pipelines can execute completely in-memory, with no required database or scheduler process.
Reproducible history
Any historical run or DAG structure can be replayed regardless of system changes.
Multiple schedules
Users can run the same pipeline or pipeline subsets on multiple schedules.
Airflow
What is Airflow?
Airflow is an open-source tool for data pipeline management. It helps you to organize your pieces of code into DAGs, run them, and test and monitor the execution.
Learn moreLuigi
What is Luigi?
Luigi is a Python package that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization, handling failures, command line integration, and much more.
Learn more