Apache Spark Archives - Page 2 of 2 - Software Engineering Daily

Uber’s Big Data Platform: 100+ Petabytes with Minute Latency

Article Thursday, November 1 2018

This article was originally written by Reza Shiftehfar on Uber’s Engineering Blog. Reposted with permission from Uber Engineering. Uber is committed to delivering safer and more

Spark Geospatial Analytics with Ram Sriharsha

Podcast Friday, May 4 2018

Phones are constantly tracking the location of a user in space. Devices like cars, smart watches, and drones are also picking up high volumes of location data. This location data is also

Spark and Streaming with Matei Zaharia

Podcast Monday, February 26 2018

Apache Spark is a system for processing large data sets in parallel. The core abstraction of Spark is the resilient distributed dataset (RDD), a working set of data that sits in memory

MemSQL with Nikita Shamgunov

Podcast Tuesday, August 18 2015

MemSQL is a high-performance, in-memory database that combines the horizontal scalability of distributed systems with the familiarity of SQL. Nikita Shamgunov is co-founder and CTO of