41. AWS Data Pipeline

Overview

  • AWS Data Pipeline is a web service that you can use to automate the movement and transformation of data.
  • With AWS Data Pipeline, you can define data-driven workflows, so that tasks can be dependent on the successful completion of previous tasks.
  • You define the parameters of your data transformations and AWS Data Pipeline enforces the logic that you've set up.

Components

  • pipeline definition specifies the business logic of your data management.
  • pipeline schedules and runs tasks by creating Amazon EC2 instances to perform the defined work activities. 
  • Task Runner polls for tasks and then performs those tasks.
  • AWS Data Pipeline works with the following services to store data.
    • Amazon DynamoDB 
    • Amazon RDS 
    • Amazon Redshift 
    • Amazon S3 
  • AWS Data Pipeline works with the following compute services to transform data.
    • Amazon EC2
    • Amazon EMR

Concepts

Pipeline Definition

  • A pipeline definition is how you communicate your business logic to AWS Data Pipeline. It contains the following information:
    • Names, locations, and formats of your data sources
    • Activities that transform the data
    • The schedule for those activities
    • Resources that run your activities and preconditions
    • Preconditions that must be satisfied before the activities can be scheduled
    • Ways to alert you with status updates as pipeline execution proceeds
  • From your pipeline definition, AWS Data Pipeline determines the tasks, schedules them, and assigns them to task runners.

Pipeline Components, Instances, and Attempts

  • Pipeline Components
    • Pipeline components represent the business logic of the pipeline and are represented by the different sections of a pipeline definition.
    • Pipeline components specify the data sources, activities, schedule, and preconditions of the workflow.
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值