Kafka的背景:
kafka的诞生,是为了解决linkedin的数据管道问题,起初linkedin的技术架构师jay creps采用了ActiveMQ来进行数据交换,大约是在2010年前后,那时的ActiveMQ还远远无法满足linkedin对数据传递系统的要求,经常由于各种缺陷而导致消息阻塞或者服务无法正常访问,为了能够解决这个问题,linkedin决定研发自己的消息传递系统,当时linkedin的首席架构师jay kreps便开始组织团队进行消息传递系统的研发;
一开始的目的是日志收集和传输(1.互联网页面page view等点击,浏览信息 2.服务器CPU,内存等类似于网管信息)
kafka是什么:
Apache Kafka® is an event streaming platform,capturing data in real-time from event sources like databases, sensors, mobile devices, cloud services, and software applications in the form of streams of events; storing these event streams durably for later retrieval; manipulating, processing, and reacting to the event streams in real-time as well as retrospectively; and routing the event streams to different destination technologies as needed.
Kafka是事件流实时处理平台,支持数据的实时获取,存储,处理。
Kafka combines three key capabilities so you can implement your use cases for event streaming end-to-end with a single battle-tested solution:
- To publish (write) and subscribe to (read) streams of events, including continuous import/export of your data from other systems.
- To store streams of events durably and reliably for as long as you want.
- To process streams of events as they occur or retrospectively
Produc