Tag Zookeeper

Replacing Hadoop with Joe Doliner

“There are a lot more people who have the problem that Hadoop solves than there are people using Hadoop.”

Pachyderm is a containerized data analytics platform that seeks to replace Hadoop.

Continue reading…

Databases: Fundamental Answers

Databases Week began with a set of fundamental questions. What is a database? Every interviewee during Database Week has given a different answer to the question of "What is a database?" — SE Daily (@software_daily) August 21, 2015 One definition: “an application component for storing and retrieving data”. All of the different databases companies have this functionality. But similarities end there. RethinkDB pushes data to the application MemSQL is a faster, proprietary version

Continue reading…

Transactions and Analytics with VoltDB’s Ryan Betts

http://traffic.libsyn.com/sedaily/voltdb_rbetts.mp3Podcast: Play in new window | DownloadStreaming pipelines and in-memory analytics are difficult to support with old database systems. VoltDB provides streaming analytics with transactions.     Questions How does VoltDB exemplify Michael Stonebraker’s thesis that one size does not fit all? What is the difference between OLTP and Streaming? How does VoltDB serve the common Zookeeper-Kafka-Storm-Cassandra stack? What trends and requirements among OLTP and OLAP systems are changing most

Continue reading…

Graph Databases with Ryan Boyd of Neo4j

http://traffic.libsyn.com/sedaily/neo4j_ryan.mp3Podcast: Play in new window | DownloadGraph databases use graph structures for semantic queries. Ryan Boyd is a developer advocate for Neo4j, an open-source graph database. Questions Why does Monsanto use graph databases? In a social network graph, how would you query for “people you may know”? What CAP tradeoffs does Neo4j make? Why isn’t BASE good enough? Links Hadoop and Graph Databases for Bioinformatics Neo4j availability discussion (explores ZooKeeper option)

Continue reading…

Apache ZooKeeper with Flavio Junqueira

http://traffic.libsyn.com/sedaily/fpj_zookeeper.mp3Podcast: Play in new window | DownloadApache ZooKeeper enables highly reliable distributed coordination. Flavio Junqueira is a committer and PMC of Apache ZooKeeper, and former VP of ZooKeeper. Questions include: Why is master election so important in Hadoop? How does a new user begin working with ZooKeeper? How do nodes “watch” each other? Should ZooKeeper be used as a message queue or notification system? What is ZooKeeper’s place in a data center

Continue reading…