Big Data

Sort by:

Competition in the Open Source Ecosystem

From Eric Sammer’s answer via Quora: At Cloudera (company) we regularly work on open source code right along side our competitors. I tend to joke that the engineers at our competitors

Hadoop: Past, Present and Future with Mike Cafarella

“HDFS is going to be a cockroach – I don’t think its ever going away.” Hadoop was created in 2003. In the early years, Hadoop provided large scale data processing with MapReduce,

Data Engineering at Airbnb with Maxime Beauchemin

“One big transformation we’re seeing right now is the slow agonizing death of MapReduce.” When a company gets big enough, there is so much data to be processed that an entire data

Benchmarking Stream Processing Frameworks with Bobby Evans

“Benchmarks are all crap, but there are some benchmarks that are better than others.” Continue reading…
stream processing frameworks

Spark in Practice with Holden Karau

“I found Spark and I was really excited because I’m a functional programming nerd, and it was written in Scala.” Continue reading…