Apache Kafka

Sort by:

Meet Apache Kafka

Kafka has become a central tool for data at many large organizations. At data-intensive companies like Fiverr and Netflix, Kafka is used simultaneously as: a database a queue for ordered

Spark and Streaming with Matei Zaharia

Apache Spark is a system for processing large data sets in parallel. The core abstraction of Spark is the resilient distributed dataset (RDD), a working set of data that sits in memory