Data Orchestration
Compare 27 data orchestration tools to find the right one for your needs
🔧 Tools
Compare and find the best data orchestration for your needs
Shipyard
A cloud-based data orchestration platform that helps data teams launch, monitor, and share their data workflows.
Mage
An open-source data pipeline tool that combines the interactivity of notebooks with the reliability of modular code.
Astronomer
A managed service for Apache Airflow that simplifies the deployment, management, and scaling of data pipelines.
dbt (data build tool)
A transformation workflow that lets teams quickly and collaboratively deploy analytics code following software engineering best practices.
Rivery
A SaaS ELT platform that provides a single solution for data ingestion, transformation, and orchestration.
Kestra
An open-source, event-driven orchestrator that simplifies data pipelines with a declarative YAML interface.
Flyte
An open-source, Kubernetes-native workflow automation platform for complex, mission-critical data and ML processes at scale.
Prefect
A modern data orchestration platform that allows you to build, run, and monitor data pipelines with Python.
Azure Data Factory
A cloud-based ETL and data integration service that allows you to create data-driven workflows for orchestrating data movement and transforming data at scale.
Argo Workflows
An open-source, container-native workflow engine for orchestrating parallel jobs on Kubernetes.
Dagster
An open-source data orchestrator for developing and maintaining data assets, such as tables, data sets, machine learning models, and reports.
AWS Step Functions
A serverless function orchestrator that makes it easy to sequence AWS Lambda functions and multiple AWS services into business-critical applications.
Control-M
An enterprise-grade application and data workflow orchestration platform that simplifies the management of complex business processes.
Databricks Workflows
A fully managed orchestration service for the Databricks Lakehouse Platform that allows you to build, run, and monitor data and AI workflows.
Apache Airflow
Open-source platform to create, schedule, and monitor workflows as Directed Acyclic Graphs (DAGs).
Google Cloud Composer
A managed Apache Airflow service that helps you create, schedule, monitor, and manage workflows.
Airbyte
An open-source data integration engine that helps you consolidate your data in your data warehouses, lakes, and databases.
Informatica
A comprehensive suite of data integration, quality, and governance tools for enterprises.
Matillion
A cloud-native data integration platform that makes it easy to load, transform, and sync data in the cloud.
Talend
A data integration and data integrity company that provides software solutions for data preparation, data quality, data integration, application integration, data management and big data.
Kubeflow
An open-source project dedicated to making deployments of machine learning workflows on Kubernetes simple, portable, and scalable.
AWS Glue
A serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.
Fivetran
An automated data movement platform that helps you centralize data from disparate sources into a cloud data warehouse.
Luigi
An open-source Python package for building complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization, and more.
IBM InfoSphere DataStage
An ETL tool and part of the IBM Information Server platform. It uses a graphical notation to construct data integration solutions.
Oracle Data Integrator (ODI)
A comprehensive data integration platform that covers all data integration requirements from high-volume, high-performance batch loads, to event-driven, trickle-feed integration processes, to SOA-enabled data services.
Metaflow
An open-source Python framework, originally developed at Netflix, for building and managing data science projects.