A data pipeline automates the process of ingesting, cataloging, transforming, and delivering data from source to destination.

Data pipelines make the ETL process more efficient and repeatable.

Services

Data Ingestion Services

  1. Amazon Kinesis Data Streams
  2. Amazon Data Firehose

Data Storage Services

  1. Amazon S3
  2. Amazon Redshift

Data Cataloging Services

Cataloging your data means adding metadata to your data.

  1. AWS Glue Data Catalog

Data Processing Services

  1. AWS Glue
  2. Amazon EMR

Data Analysis and Visualization Services

  1. Amazon Athena
  2. Amazon Redshift
  3. Amazon QuickSight
  4. Amazon OpenSearch Service