Samza relevant concept

For kafka:

topics are partitioned into partitions by key;

partitions are on brokers; each broker can hold partitions from different topics

each consumers group hold different consumers, each consumer receives data from multiple partitions;

each producer/topic writes to multiple partitions.

For yarn:

NodeManager, which is responsible for launching processes on that machine

ResourceManager talks to all of the NodeManagers to tell them what to run

ApplicationMaster, is actually application-specific code that runs in the YARN cluster

 

 

Samza supports 2 kinds of  processing:

stateless processing: does not retain any state associated with the current message after it has been processed

stateful processing: requires you to record some state about a message even after processing it

 

Samza supports two notions of time: processing time and embedded source time

 

Samza guarantee each record is processed at least once

 

Samza's cordinator supports both embedded library model(kafka) and framework model(flink).

 

Samza supports both in-order and out-of-order processing. 

Each thread runs one or more tasks

 

reference:http://samza.apache.org/learn/documentation/latest/core-concepts/core-concepts.html

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值