Skip to content

Data orchestrators

Data orchestration is the process of moving siloed data from multiple storage locations into a centralized repository where it can then be combined, cleaned, and enriched for activation.

Data orchestrators are web applications that make this process easy. The most popular right now are:

  • Apache Airflow
  • Kestra
  • Prefect

There are several comparison pages:

When looking at the return on investment when choosing an orchestration tool, there are several points to consider:

  • Time of installation/maintenance
  • Time to write pipeline
  • Time to execute (performance)

Kestra

Pros:

Cons:

Kestra offers a higher ROI globally compared to Airflow:

  • Installing Kestra is easier than Airflow; it doesn’t require Python dependencies, and it comes with a ready-to-use docker-compose file using few services and without the need to understand what’s an executor to run task in parallel.
  • Creating pipelines with Kestra is simple, thanks to its syntax. You don’t need knowledge of a specific programming language because Kestra is designed to be agnostic. The declarative YAML design makes Kestra flows more readable compared to Airflow’s DAG equivalent, allowing developers to significantly reduce development time.
  • In this benchmark, Kestra demonstrates better execution time than Airflow under any configuration setup.