📋

Data Orchestration

Compare 27 data orchestration tools to find the right one for your needs

🔧 Tools

Compare and find the best data orchestration for your needs

Shipyard

The modern data orchestration platform.

A cloud-based data orchestration platform that helps data teams launch, monitor, and share their data workflows.

View tool details →

Mage

Open-source data pipeline tool for transforming and integrating data.

An open-source data pipeline tool that combines the interactivity of notebooks with the reliability of modular code.

View tool details →

Astronomer

The commercial developer of Apache Airflow.

A managed service for Apache Airflow that simplifies the deployment, management, and scaling of data pipelines.

View tool details →

dbt (data build tool)

The T in ELT.

A transformation workflow that lets teams quickly and collaboratively deploy analytics code following software engineering best practices.

View tool details →

Rivery

The complete ELT platform.

A SaaS ELT platform that provides a single solution for data ingestion, transformation, and orchestration.

View tool details →

Kestra

The open-source data orchestration and scheduling platform.

An open-source, event-driven orchestrator that simplifies data pipelines with a declarative YAML interface.

View tool details →

Flyte

The open-source, structured development platform for complex AI and data products.

An open-source, Kubernetes-native workflow automation platform for complex, mission-critical data and ML processes at scale.

View tool details →

Prefect

The easiest way to orchestrate and observe your data pipelines.

A modern data orchestration platform that allows you to build, run, and monitor data pipelines with Python.

View tool details →

Azure Data Factory

A fully managed, serverless data integration service.

A cloud-based ETL and data integration service that allows you to create data-driven workflows for orchestrating data movement and transforming data at scale.

View tool details →

Argo Workflows

An open source container-native workflow engine for orchestrating parallel jobs on Kubernetes.

An open-source, container-native workflow engine for orchestrating parallel jobs on Kubernetes.

View tool details →

Dagster

The data orchestration platform.

An open-source data orchestrator for developing and maintaining data assets, such as tables, data sets, machine learning models, and reports.

View tool details →

AWS Step Functions

Visual workflows for distributed applications.

A serverless function orchestrator that makes it easy to sequence AWS Lambda functions and multiple AWS services into business-critical applications.

View tool details →

Control-M

Application and data workflow orchestration.

An enterprise-grade application and data workflow orchestration platform that simplifies the management of complex business processes.

View tool details →

Databricks Workflows

Orchestrate any data, analytics, and AI workflow on any cloud.

A fully managed orchestration service for the Databricks Lakehouse Platform that allows you to build, run, and monitor data and AI workflows.

View tool details →

Apache Airflow

A platform to programmatically author, schedule, and monitor workflows.

Open-source platform to create, schedule, and monitor workflows as Directed Acyclic Graphs (DAGs).

View tool details →

Google Cloud Composer

A fully managed workflow orchestration service built on Apache Airflow.

A managed Apache Airflow service that helps you create, schedule, monitor, and manage workflows.

View tool details →

Airbyte

The open-source data integration platform.

An open-source data integration engine that helps you consolidate your data in your data warehouses, lakes, and databases.

View tool details →

Informatica

The enterprise cloud data management leader.

A comprehensive suite of data integration, quality, and governance tools for enterprises.

View tool details →

Matillion

The Data Productivity Cloud.

A cloud-native data integration platform that makes it easy to load, transform, and sync data in the cloud.

View tool details →

Talend

A Qlik Company. The modern, low-code platform for data.

A data integration and data integrity company that provides software solutions for data preparation, data quality, data integration, application integration, data management and big data.

View tool details →

Kubeflow

The Machine Learning Toolkit for Kubernetes.

An open-source project dedicated to making deployments of machine learning workflows on Kubernetes simple, portable, and scalable.

View tool details →

AWS Glue

A serverless data integration service.

A serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.

View tool details →

Fivetran

The global leader in data movement.

An automated data movement platform that helps you centralize data from disparate sources into a cloud data warehouse.

View tool details →

Luigi

A Python module that helps you build complex pipelines of batch jobs.

An open-source Python package for building complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization, and more.

View tool details →

IBM InfoSphere DataStage

A flexible data integration tool.

An ETL tool and part of the IBM Information Server platform. It uses a graphical notation to construct data integration solutions.

View tool details →

Oracle Data Integrator (ODI)

High-performance, scalable data integration.

A comprehensive data integration platform that covers all data integration requirements from high-volume, high-performance batch loads, to event-driven, trickle-feed integration processes, to SOA-enabled data services.

View tool details →

Metaflow

A human-friendly Python library for building and managing real-life data science projects.

An open-source Python framework, originally developed at Netflix, for building and managing data science projects.

View tool details →