Data Integration & ETL
Compare 152 data integration & etl tools to find the right one for your needs
📂 Subcategories
🔧 Tools
Compare and find the best data integration & etl for your needs
Polytomic
A no-code Reverse ETL platform focused on simplicity and speed, enabling anyone to sync data to their business tools.
Mage
An open-source data pipeline tool that combines the interactivity of notebooks with the reliability of modular code.
Shipyard
A cloud-based data orchestration platform that helps data teams launch, monitor, and share their data workflows.
RisingWave
A distributed SQL streaming database for real-time analytics and stream processing.
Estuary Flow
A managed platform for real-time, streaming ETL and CDC.
Arcion
High-performance, agentless CDC for enterprise databases.
Estuary Flow
A real-time data integration platform for building streaming ETL pipelines with historical backfills.
Rivery
A SaaS ELT platform for data ingestion, transformation, and orchestration.
Hightouch
Hightouch is a Reverse ETL platform that helps you sync data from your warehouse to your SaaS tools.
BryteFlow
A no-code tool for real-time data replication and integration.
Census
Census is a Reverse ETL platform that syncs data from your data warehouse to your business applications.
Rivery
Rivery is a SaaS ELT platform that provides a fully-managed solution for data ingestion, transformation, and orchestration.
Dataddo
Dataddo is a no-code, cloud-based data integration platform that connects to any online data source and sends the data to a variety of destinations.
Portable
Portable is a cloud-based ELT platform that specializes in building and maintaining connectors for long-tail data sources.
Dataddo
A no-code, cloud-based data integration platform that can send data to a wide range of destinations.
dbt
dbt is a data transformation tool that enables data analysts and engineers to transform data in their warehouse more effectively.
Rivery
A SaaS ELT platform that provides a single solution for data ingestion, transformation, and orchestration.
dbt (Data Build Tool)
A transformation workflow that lets teams quickly and collaboratively deploy analytics code.
Kestra
An open-source, event-driven orchestrator that simplifies data pipelines with a declarative YAML interface.
dbt (data build tool)
A transformation workflow that lets teams quickly and collaboratively deploy analytics code following software engineering best practices.
Quix
A platform for building and running real-time data applications in Python.
Castled
Castled is an open-source, developer-focused Reverse ETL platform that can be self-hosted or used as a cloud service.
SeekWell
SeekWell is a platform that helps business teams use SQL to send data from databases and warehouses to applications like Google Sheets, Slack, and email.
Weld
Weld is an all-in-one data platform that combines ELT, data modeling, and Reverse ETL in a single product.
Decodable
A serverless real-time data platform that makes it easy to connect, process, and move data between systems.
Rivery
A SaaS ELT platform that provides a single solution for data ingestion, transformation, and orchestration.
Astronomer
A managed service for Apache Airflow that simplifies the deployment, management, and scaling of data pipelines.
Flyte
An open-source, Kubernetes-native workflow automation platform for complex, mission-critical data and ML processes at scale.
Rivery
Rivery is a SaaS ELT platform that offers data integration, transformation, and orchestration, with Reverse ETL as part of its feature set.
Ably
A serverless platform for powering realtime digital experiences like live chat, notifications, and collaborative features.
Materialize
A streaming database that computes and maintains materialized views on streaming data.
Striim
A unified platform for real-time data integration, streaming analytics, and data visualization.
Meltano
An open-source DataOps platform for building, orchestrating, and managing data pipelines.
Keboola
An end-to-end data operations platform designed for building, deploying, and managing analytics projects.
Keboola
An end-to-end data operations platform designed for building, deploying, and managing data products.
Meltano
Meltano is an open-source DataOps platform that helps you manage the entire lifecycle of your data pipelines.
Workato
Workato is an iPaaS (Integration Platform as a Service) that provides extensive automation and integration capabilities, including Reverse ETL patterns.
Azure Data Factory
A hybrid data integration service for orchestrating and automating data movement and transformation.
Striim
An end-to-end streaming data integration and operational intelligence platform.
Striim
A unified platform for real-time data integration and streaming analytics.
Qlik Replicate
Qlik Replicate is a data replication and ingestion tool that enables real-time data movement from a wide range of sources to your target systems.
Hevo Data
A no-code data pipeline platform that helps you move data in real-time.
Hevo Data
Hevo Data is a no-code data pipeline platform that helps you move data from any source to your warehouse in real-time.
Funnel
A no-code data platform for marketers that automatically collects, cleans, and maps marketing data.
Striim
A unified platform for real-time data integration, streaming analytics, and data movement.
Meltano
An open-source platform for building, orchestrating, and managing data pipelines with a code-first approach.
Striim
Striim is a unified data streaming and integration platform that enables real-time data ingestion, processing, and delivery.
Hevo Data
A no-code data pipeline platform that helps you move data from any source to your warehouse in real-time.
Hevo Data
A no-code data pipeline platform that helps you move data from any source to your warehouse in real-time.
Hightouch
A leading Reverse ETL platform that empowers teams to activate customer data from their data warehouse directly into their go-to-market tools.
RudderStack
RudderStack is a customer data platform (CDP) built for developers, with Reverse ETL as a core capability.
Azure Data Factory
A hybrid data integration service for orchestrating and automating data movement and transformation.
Argo Workflows
An open-source, container-native workflow engine for orchestrating parallel jobs on Kubernetes.
Pusher
A set of hosted APIs for adding realtime features like notifications, chat, and collaboration to web and mobile apps.
Tray.io
Tray.io is a flexible, low-code automation platform (iPaaS) that can be used to build powerful Reverse ETL workflows.
Prefect
A modern data orchestration platform that allows you to build, run, and monitor data pipelines with Python.
Azure Data Factory
A cloud-based ETL and data integration service that allows you to create data-driven workflows for orchestrating data movement and transforming data at scale.
Apache Spark Streaming
An extension of the core Spark API that enables scalable, high-throughput, fault-tolerant stream processing of live data streams.
Apache Flink
An open-source stream processing framework for stateful computations over unbounded and bounded data streams.
Segment
Segment is a leading Customer Data Platform (CDP) that collects, cleans, and activates customer data, with Reverse ETL as a key feature.
PubNub
A developer API platform for building and scaling realtime applications.
Control-M
An enterprise-grade application and data workflow orchestration platform that simplifies the management of complex business processes.
Stitch
Stitch is a cloud-based ELT platform, acquired by Talend, that focuses on simple, reliable data ingestion, with some capabilities for Reverse ETL.
Dagster
An open-source data orchestrator for developing and maintaining data assets, such as tables, data sets, machine learning models, and reports.
AWS Step Functions
A serverless function orchestrator that makes it easy to sequence AWS Lambda functions and multiple AWS services into business-critical applications.
Databricks
A unified data and AI platform that includes capabilities for streaming data processing.
Databricks Workflows
A fully managed orchestration service for the Databricks Lakehouse Platform that allows you to build, run, and monitor data and AI workflows.
Confluent Platform
An enterprise-grade data streaming platform built by the original creators of Apache Kafka.
Stitch Data
A cloud-first, developer-focused platform for rapidly moving data from dozens of sources to a data warehouse.
Google Cloud Datastream
Serverless CDC and replication service from Google Cloud.
Stitch Data
A cloud-first, developer-focused, and open-source platform for rapidly moving data.
Airbyte
An open-source ELT tool for moving data from applications, APIs, and databases to data warehouses and lakes.
Stitch
Stitch is a cloud-based ELT platform that provides simple, reliable data pipelines for developers and data analysts.
Airbyte
Airbyte is an open-source ELT platform with a large and growing library of connectors, offering both self-hosted and cloud-managed options.
Azure Data Factory
Azure Data Factory is a cloud-based data integration service that allows you to create, schedule, and orchestrate your ETL and ELT workflows.
Funnel.io
A data platform for marketers to collect, prepare, and analyze their data from all marketing and advertising platforms.
Confluent
An enterprise data streaming platform built on Apache Kafka.
Supermetrics
A data connector that helps marketers get data from various platforms into their reporting and analytics tools.
Census
Census syncs data from your cloud data warehouse to all your go-to-market tools, empowering teams to act with confidence.
Qlik Replicate
Universal data replication and ingestion software.
Qlik Replicate
A data replication and ingestion tool that moves data easily, securely, and efficiently with minimal operational impact.
Informatica Intelligent Data Management Cloud
An enterprise-grade, AI-powered platform for data integration, quality, governance, and more.
Debezium
Open-source distributed platform for CDC.
Airbyte
Open-source data integration platform for ELT pipelines.
Matillion
A cloud-native data integration and transformation platform.
AWS Database Migration Service (DMS)
A cloud service for database migration and replication.
Informatica Intelligent Data Management Cloud
An enterprise cloud data management platform for data integration, quality, and governance.
Informatica PowerExchange
High-performance CDC for diverse and legacy data sources.
StreamSets (Software AG)
A data engineering platform for building and operating data pipelines.
Matillion
A cloud-native data integration platform built specifically for cloud data warehouses.
Precisely Connect
Data integration software with strong legacy and mainframe CDC.
Adverity
An intelligent platform for marketing data integration, analytics, and reporting.
Matillion
Matillion is a cloud-native ELT platform designed to work with cloud data warehouses like Snowflake, Redshift, and BigQuery.
Airbyte
An open-source ELT platform that helps you replicate data from applications, APIs & databases to data warehouses.
Matillion
A cloud-native data integration and transformation platform designed for modern data teams.
Matillion
Matillion is a cloud-native data integration platform built for major cloud data warehouses, offering both ETL and Reverse ETL capabilities.
Airbyte
An open-source data integration engine that helps you consolidate your data in your data warehouses, lakes, and databases.
Matillion
A cloud-native data integration platform that makes it easy to load, transform, and sync data in the cloud.
Apache Kafka
An open-source distributed event streaming platform for high-performance data pipelines, streaming analytics, and data integration.
Azure Stream Analytics
A real-time analytics and complex event-processing engine on Microsoft Azure.
Google Cloud Composer
A managed Apache Airflow service that helps you create, schedule, monitor, and manage workflows.
Apache Airflow
Open-source platform to create, schedule, and monitor workflows as Directed Acyclic Graphs (DAGs).
ksqlDB
A streaming database that enables real-time data processing and stream processing on Apache Kafka using SQL.
StreamSets (IBM)
A data integration platform for building and operating smart data pipelines for streaming and batch data.
Grouparoo
Grouparoo is an open-source Reverse ETL tool, now part of Airbyte, designed to sync data from warehouses to customer-facing tools.
Informatica
A comprehensive suite of data integration, quality, and governance tools for enterprises.
Hevo Data
Hevo is a no-code data pipeline platform that provides both ELT and Reverse ETL capabilities, focusing on ease of use and automation.
Kubeflow
An open-source project dedicated to making deployments of machine learning workflows on Kubernetes simple, portable, and scalable.
AWS Glue
A serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.
Talend
A data integration and data integrity company that provides software solutions for data preparation, data quality, data integration, application integration, data management and big data.
Amazon Kinesis
A suite of services for collecting, processing, and analyzing real-time streaming data on AWS.
Google Cloud Dataflow
A fully managed service for executing Apache Beam pipelines for stream and batch data processing.
SAP Integration Suite
SAP Integration Suite is SAP's enterprise iPaaS, enabling integration of SAP and third-party applications, and can be used for Reverse ETL.
Integrate.io
Integrate.io is a low-code data platform that offers ETL, ELT, Reverse ETL, and API generation capabilities.
TIBCO Streaming
An enterprise-grade, cloud-ready streaming analytics platform for building real-time applications.
Integrate.io
Integrate.io is a cloud-based data integration platform that offers ETL, ELT, and Reverse ETL capabilities.
Pentaho Data Integration
An open-source data integration platform for ETL, business analytics, and reporting.
Oracle GoldenGate
Comprehensive software for real-time data integration and replication.
Talend
A unified platform for data integration, data integrity, and data governance.
Pentaho
Pentaho is a business intelligence (BI) and data integration platform that provides a comprehensive suite of tools for accessing, preparing, and analyzing data.
Integrate.io
A low-code data platform for ETL, ELT, CDC, and API generation.
AWS Glue
A serverless data integration service that makes it easy to discover, prepare, and combine data for analytics.
Integrate.io
A low-code data platform for ETL, ELT, Reverse ETL, and API generation.
Talend
A unified platform for data integration, data integrity, and data governance.
Talend (Qlik)
A unified platform for data integration, quality, and governance.
AWS Glue
A serverless ETL service that makes it easy to prepare and load data for analytics.
AWS Glue
AWS Glue is a fully managed extract, transform, and load (ETL) service that makes it easy to prepare and load your data for analytics.
Informatica
Informatica is a leading provider of enterprise cloud data management solutions, including data integration, data quality, and master data management.
Talend
Talend is a data integration and management platform that provides a comprehensive suite of tools for ETL, data quality, and data governance.
Airbyte
Airbyte is an open-source data integration platform for ELT that has expanded to include Reverse ETL capabilities.
Fivetran
An automated data integration platform that helps you centralize data from disparate sources into a single destination.
Fivetran
Fivetran is a cloud-based ELT platform that automates data integration from various sources into data warehouses.
Fivetran
Automated data movement platform.
SAP Data Services
SAP Data Services is a data integration and transformation software that helps you move and transform data from various sources to your target systems.
Google Cloud Data Fusion
A cloud-native data integration service for building and managing ETL/ELT data pipelines.
IBM InfoSphere DataStage
IBM InfoSphere DataStage is an ETL tool and part of the IBM InfoSphere Information Server suite. It uses a graphical notation to construct data integration solutions.
Fivetran
Automates data integration from source to destination, making data accessible and actionable.
Google Cloud Data Fusion
A cloud-native data integration service that helps users efficiently build and manage ETL/ELT data pipelines.
Google Cloud Data Fusion
Google Cloud Data Fusion is a fully managed, cloud-native data integration service that helps users efficiently build and manage ETL/ELT data pipelines.
Fivetran
Fivetran is a leader in automated data movement, primarily for ELT, but now offers Reverse ETL capabilities to sync data back to business applications.
IBM InfoSphere DataStage
An ETL tool and part of the IBM Information Server platform. It uses a graphical notation to construct data integration solutions.
Fivetran
An automated data movement platform that helps you centralize data from disparate sources into a cloud data warehouse.
Luigi
An open-source Python package for building complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization, and more.
Oracle Integration Cloud (OIC)
Oracle Integration Cloud is an enterprise-grade iPaaS that connects Oracle and non-Oracle applications, capable of performing Reverse ETL.
Informatica
Informatica is a major enterprise data integration and management platform whose tools can be used to perform Reverse ETL workflows.
Oracle Data Integrator (ODI)
A comprehensive data integration platform that covers all data integration requirements from high-volume, high-performance batch loads, to event-driven, trickle-feed integration processes, to SOA-enabled data services.
Oracle Data Integrator (ODI)
Oracle Data Integrator is a comprehensive data integration platform that covers all data integration requirements, from high-volume, high-performance batch loads to event-driven, trickle-feed integration processes.
IBM InfoSphere Change Data Capture
Enterprise software for log-based CDC and data replication.
Singer.io
An open-source specification that describes how data extraction and loading scripts should communicate.
Metaflow
An open-source Python framework, originally developed at Netflix, for building and managing data science projects.
Omnata
Omnata is a Reverse ETL and data integration platform that pushes data directly from the data warehouse into SaaS applications without a middle-layer.
Bytewax
An open-source Python framework for building stateful stream processing applications.