Cassandra with Tim Berglund

cassandra

tim-berglund

“There isn’t any central node in Cassandra. Every node is a peer, there is no master – there is no single point of failure.”

Apache Cassandra can serve as both the real-time data store for online transactional applications, as well as the read-intensive database for data warehousing operations. In order to combine these two use cases into a single database, Apache Cassandra required lots of innovation.

In today’s episode of Software Engineering Daily, we discuss the internals of Cassandra. Tim Berglund takes us through how Cassandra performs reads and writes, how Cassandra offers tunable consistency, and what inspirations Cassandra took from the Amazon Dynamo paper as well as the Google BigTable paper. Tim will also be presenting at Strata + Hadoop World in San Jose. We’re partnering with O’Reilly to support this conference – if you want to go to Strata, you can save 20% off a ticket with our code PCSED.

Questions

  • What is Cassandra?
  • What did Cassandra take away from Dynamo?
  • Have there been any methodologies or tactics around distributed hashing that have evolved since the time of the Dynamo paper?
  • What is the explain how Cassandra is a hybrid between column-oriented and key-value data stores?
  • Why and how can we explore the different axes of tunable consistency?
  • How was Cassandra constructed, taking into account hardware failure?
  • Can you walk me through a simple Cassandra write?
  • What are you building as part of Datastax training?

Links

Software Daily

Software Daily

 
Subscribe to Software Daily, a curated newsletter featuring the best and newest from the software engineering community.