Apache Spark Archives - Software Engineering Daily

Prophecy: Apple of Data Engineering with Raj Bains

Podcast Wednesday, July 28 2021

Prophecy is a complete Low-Code Data Engineering Platform for the Enterprise. Prophecy enables all your teams on Apache Spark with a unique low-code designer. While you visually build

StreamSets: DataOps and Smart Pipelines with Arvind Prabhakar

Podcast Thursday, June 17 2021

The company StreamSets is enabling DataOps practices in today’s enterprises. StreamSets is a data engineering platform designed to help engineers design, deploy, and operate smart data

Data Mechanics: Data Engineering with Jean-Yves Stephan

Podcast Friday, May 14 2021

Apache Spark is a popular open source analytics engine for large-scale data processing. Applications can be written in Java, Scala, Python, R, and SQL. These applications have flexible

Data Lakehouse with Michael Armbrust

Podcast Friday, May 1 2020

A data warehouse is a system for performing fast queries on large amounts of data. A data lake is a system for storing high volumes of data in a format that is slow to access. A typical

Notebooks at Netflix with Matthew Seal

Podcast Tuesday, January 15 2019

Netflix has petabytes of data and thousands of workloads running across that data every day. These workloads generate movie recommendations for users, create dashboards for data analysts