Apache Kafka with Guozhang Wang

Podcast Thursday, August 6 2015

Podcast: Play in new window | Download

Subscribe: RSS

Apache Kafka is a publish-subscribe messaging system rethought as a distributed commit log.

Kafka serves as the central repository for data streams in a distributed system.

Guozhang Wang is an engineer at Confluent, which offers a stream data platform built using Kafka.

Questions include:

What is a central repository for data streams?
How does Kafka improve transportation between systems?
How does Kafka allow for richer analytical processing?
What are the roles of topics, producers, consumers, and brokers?
Do Spark, Storm, and Samza all use Kafka the same way?
How does Kafka combine queueing and pub-sub into a single abstraction: the consumer group?

Links:

A Practical Guide to Kafka, by Jay Kreps
Kafka Documentation
Kafka Podcast on Software Engineering Radio
Kafka Podcast on All Things Hadoop includes notes and diagrams)
Kafka Podcast on O’Reilly Data

Jeff

Exclusive Articles

VMware Tanzu GemFire and Next-Generation Real-Time Application Development

Uber’s LedgerStore and its Trillions of Indexes with Kaushik Devarajaiah

GraphQL vs. REST: What Are They, and Which Is Better for You?

Cloud Engineering

Building Chess.com with Jay Severson

Mastodon with Eugen Rochko

AWS re:Invent Special: PartyRock Generative AI Apps with Mike Miller

Business and Philosophy

Startup Investing with George Mathew

KubeCon Special: Docker with Justin Cormack

Software Architecture with Josh Prismon

Greatest Hits

Hardening C++ with Bjarne Stroustrup

Surviving ChatGPT with Christian Hubicki

Special Episode with George Hotz

Hackers

Making React 70% faster with Aiden Bai of Million.js

Cross-functional Incident Management with Ashley Sawatsky and Niall Murphy

SDKs for your API with Sagar Batchu

Data

Hyperscaling SQL with Sam Lambert

Spring AI and Java in 2024

Iceberg at Netflix and Beyond with Ryan Blue